снартев 17

HOW MASS-ENERGY GENERATES CURVATURE

The physical world is represented as a four-dimensional continuum. If in this I adopt a Riemannian metric, and look for the simplest laws which such a metric can satisfy, I arrive at the relativistic gravitation theory of empty space. If I adopt in this space a vector field, or the antisymmetrical tensor field derived from it, and if I look for the simplest laws which such a field can satisfy, I arrive at the Maxwell equations for free space.
at any given moment, out of all conceivable constructions,
a single one has always proved itself absolutely
superior to all the rest
ALBERT EINSTEIN (1934, p. 18)

§17.1. AUTOMATIC CONSERVATION OF THE SOURCE AS THE CENTRAL IDEA IN THE FORMULATION OF THE FIELD EQUATION

This section derives the "Einstein field equation"
Turn now from the response of matter to geometry (motion of a neutral test particle on a geodesic; "comma-goes-to-semicolon rule" for the dynamics of matter and fields), and analyze the response of geometry to matter.
Mass is the source of gravity. The density of mass-energy as measured by any observer with 4 -velocity u u u\boldsymbol{u}u is
(17.1) ρ = u T u = u α T α β u β . (17.1) ρ = u T u = u α T α β u β . {:(17.1)rho=u*T*u=u^(alpha)T_(alpha beta)u^(beta).:}\begin{equation*} \rho=\boldsymbol{u} \cdot \boldsymbol{T} \cdot \boldsymbol{u}=u^{\alpha} T_{\alpha \beta} u^{\beta} . \tag{17.1} \end{equation*}(17.1)ρ=uTu=uαTαβuβ.
Therefore the stress-energy tensor T T T\boldsymbol{T}T is the frame-independent "geometric object" that must act as the source of gravity.
This source, this geometric object, is not an arbitrary symmetric tensor. It must have zero divergence
(17.2) T = 0 (17.2) T = 0 {:(17.2)grad*T=0:}\begin{equation*} \boldsymbol{\nabla} \cdot \boldsymbol{T}=0 \tag{17.2} \end{equation*}(17.2)T=0
because only so can the law of conservation of momentum-energy be upheld.
Place this source, T T T\boldsymbol{T}T, on the righthand side of the equation for the generation of gravity. On the lefthand side will stand a geometric object that characterizes gravity. That object, like T T T\boldsymbol{T}T, must be a symmetric, divergence-free tensor; and if it is to characterize gravity, it must be built out of the geometry of spacetime and nothing but that geometry. Give this object the name "Einstein tensor" and denote it by G G G\boldsymbol{G}G, so that the equation for the generation of gravity reads
(17.3) G = κ T . [ proportionality factor; to be evaluated later ] (17.3) G = κ T .  proportionality factor;   to be evaluated later  {:[(17.3)G=kappa T.],[ uarr[[" proportionality factor; "],[" to be evaluated later "]]]:}\begin{align*} \boldsymbol{G}= & \kappa \boldsymbol{T} . \tag{17.3}\\ & \uparrow\left[\begin{array}{c} \text { proportionality factor; } \\ \text { to be evaluated later } \end{array}\right] \end{align*}(17.3)G=κT.[ proportionality factor;  to be evaluated later ]
(Do not assume that G G G\boldsymbol{G}G is the same Einstein tensor as was encountered in Chapters 8 , 13 , 14 8 , 13 , 14 8,13,148,13,148,13,14, and 15 ; that will be proved below!)
The vanishing of the divergence G G grad*G\boldsymbol{\nabla} \cdot \boldsymbol{G}G is not to be regarded as a consequence of T = 0 T = 0 grad*T=0\boldsymbol{\nabla} \cdot \boldsymbol{T}=0T=0. Rather, the obedience of all matter and fields to the conservation law T = 0 T = 0 grad*T=0\boldsymbol{\nabla} \cdot \boldsymbol{T}=0T=0 is to be regarded (1) as a consequence of the way [equation (17.3)] they are wired into the geometry of spacetime, and therefore (2) as required and enforced by an automatic conservation law, or identity, that holds for any smooth Riemannian spacetime whatsoever, physical or not: G 0 G 0 grad*G-=0\boldsymbol{\nabla} \cdot \boldsymbol{G} \equiv 0G0. (See Chapter 15 for a fuller discussion and $ 17.2 $ 17.2 $17.2\$ 17.2$17.2 below for a fuller justification.) Accordingly, look for a symmetric tensor G G G\boldsymbol{G}G that is an "automatically conserved measure of the curvature of spacetime" in the following sense:
(1) G G G\boldsymbol{G}G vanishes when spacetime is flat.
(2) G G G\boldsymbol{G}G is constructed from the Riemann curvature tensor and the metric, and from nothing else.
(3) G G G\boldsymbol{G}G is distinguished from other tensors which can be built from Riemann and g g g\boldsymbol{g}g by the demands (i) that it be linear in Riemann, as befits any natural measure of curvature; (ii) that, like T T T\boldsymbol{T}T, it be symmetric and of second rank; and (iii) that it have an automatically vanishing divergence,
(17.4) G 0 (17.4) G 0 {:(17.4)grad*G-=0:}\begin{equation*} \boldsymbol{\nabla} \cdot \boldsymbol{G} \equiv 0 \tag{17.4} \end{equation*}(17.4)G0
Apart from a multiplicative constant, there is only one tensor (exercise 17.1) that satisfies these requirements of being an automatically conserved, second-rank tensor, linear in the curvature, and of vanishing when spacetime is flat. It is the Einstein curvature tensor, G G G\boldsymbol{G}G, expressed in Chapter 8 in terms of the Ricci curvature tensor:
R μ ν = R α μ α ν (17.5) G μ ν = R μ ν 1 2 g μ ν R . R μ ν = R α μ α ν (17.5) G μ ν = R μ ν 1 2 g μ ν R . {:[R_(mu nu)=R^(alpha)_(mu alpha nu)],[(17.5)G_(mu nu)=R_(mu nu)-(1)/(2)g_(mu nu)R.]:}\begin{align*} R_{\mu \nu} & =R^{\alpha}{ }_{\mu \alpha \nu} \\ G_{\mu \nu} & =R_{\mu \nu}-\frac{1}{2} g_{\mu \nu} R . \tag{17.5} \end{align*}Rμν=Rαμαν(17.5)Gμν=Rμν12gμνR.
Equation describing how matter generates gravity must have form G = κ T G = κ T G=kappa T\boldsymbol{G}=\kappa \boldsymbol{T}G=κT, where T T T\boldsymbol{T}T is stress-energy tensor
Properties that the tensor G G G\boldsymbol{G}G must have
Proof that G G G\boldsymbol{G}G must be the Einstein curvature tensor of Chapter 8
This quantity was given vivid meaning in Chapter 15 as the "moment of rotation of the curvature" or, more simply, the "moment of rotation," constructed by taking the double-dual
(17.6a) G = Riemann (17.6a) G =  Riemann  {:(17.6a)G=^(**)" Riemann "^(**):}\begin{equation*} G={ }^{*} \text { Riemann }{ }^{*} \tag{17.6a} \end{equation*}(17.6a)G= Riemann 
of the Riemann curvature tensor, and then contracting this double dual,
(17.6b) G μ ν = G α μ α ν . (17.6b) G μ ν = G α μ α ν . {:(17.6b)G_(mu nu)=G^(alpha)_(mu alpha nu).:}\begin{equation*} G_{\mu \nu}=G^{\alpha}{ }_{\mu \alpha \nu} . \tag{17.6b} \end{equation*}(17.6b)Gμν=Gαμαν.
In Chapter 15 the vanishing of G G grad*G\boldsymbol{\nabla} \cdot \boldsymbol{G}G was shown to follow as a consequence of the elementary principle of topology that "the boundary of a boundary is zero."
To evaluate the proportionality constant κ κ kappa\kappaκ in the "Einstein field equation" G = κ T G = κ T G=kappa T\boldsymbol{G}=\kappa \boldsymbol{T}G=κT, one can compare with the well-tested Newtonian theory of gravity. To facilitate the comparison, examine the relative acceleration (geodesic deviation) of particles that fall down a pipe inserted into an idealized Earth of uniform density ρ ρ rho\rhoρ (Figure 1.12). According to Newton, the relative acceleration is governed by the density; according to Einstein, it is governed by the Riemann curvature of spacetime. Direct comparison of the Newtonian and Einstein predictions using Newtonian coordinates (where g μ ν η μ ν g μ ν η μ ν g_(mu nu)~~eta_(mu nu)g_{\mu \nu} \approx \eta_{\mu \nu}gμνημν ) reveals the relation
(17.7) R 00 R α 0 α 0 = 4 π ρ . (17.7) R 00 R α 0 α 0 = 4 π ρ . {:(17.7)R_(00)-=R^(alpha)_(0alpha0)=4pi rho.:}\begin{equation*} R_{00} \equiv R^{\alpha}{ }_{0 \alpha 0}=4 \pi \rho . \tag{17.7} \end{equation*}(17.7)R00Rα0α0=4πρ.
(See § 1.7 § 1.7 §1.7\S 1.7§1.7 for details of the derivation; see Chapter 12 for extensive discussion of Newtonian gravity using this equation.) When applied to the Earth's interior, the Einstein field equation G = κ T G = κ T G=kappa T\boldsymbol{G}=\kappa \boldsymbol{T}G=κT must thus reduce to R 00 = 4 π ρ R 00 = 4 π ρ R_(00)=4pi rhoR_{00}=4 \pi \rhoR00=4πρ. In component form, the Einstein field equation reads
G μ ν = R μ ν 1 2 g μ ν R = κ T μ ν . G μ ν = R μ ν 1 2 g μ ν R = κ T μ ν . G_(mu nu)=R_(mu nu)-(1)/(2)g_(mu nu)R=kappaT_(mu nu).G_{\mu \nu}=R_{\mu \nu}-\frac{1}{2} g_{\mu \nu} R=\kappa T_{\mu \nu} .Gμν=Rμν12gμνR=κTμν.
Its trace reads
R = R 2 R = κ T R = R 2 R = κ T -R=R-2R=kappa T-R=R-2 R=\kappa TR=R2R=κT
In consequence, it predicts
R 00 = 1 2 g 00 R + κ T 00 = 1 2 κ ( 2 T 00 g 00 1 T ) = 1 2 κ [ 2 T 00 + ( T 0 0 + T j j ) ] = 1 2 κ ( T 00 + T j j ) , R 00 = 1 2 g 00 R + κ T 00 = 1 2 κ ( 2 T 00 g 00 1 T ) = 1 2 κ 2 T 00 + T 0 0 + T j j = 1 2 κ T 00 + T j j , {:[R_(00)=(1)/(2)g_(00)R+kappaT_(00)=(1)/(2)kappa(2T_(00)-ubrace(g_(00)ubrace)_(-1)T)],[=(1)/(2)kappa[2T_(00)+(T_(0)^(0)+T_(j)^(j))]],[=(1)/(2)kappa(T_(00)+T_(j)^(j))","]:}\begin{aligned} R_{00} & =\frac{1}{2} g_{00} R+\kappa T_{00}=\frac{1}{2} \kappa(2 T_{00}-\underbrace{g_{00}}_{-1} T) \\ & =\frac{1}{2} \kappa\left[2 T_{00}+\left(T_{0}^{0}+T_{j}^{j}\right)\right] \\ & =\frac{1}{2} \kappa\left(T_{00}+T_{j}^{j}\right), \end{aligned}R00=12g00R+κT00=12κ(2T00g001T)=12κ[2T00+(T00+Tjj)]=12κ(T00+Tjj),
which reduces to
(17.8) R 00 = 1 2 κ ρ (17.8) R 00 = 1 2 κ ρ {:(17.8)R_(00)=(1)/(2)kappa rho:}\begin{equation*} R_{00}=\frac{1}{2} \kappa \rho \tag{17.8} \end{equation*}(17.8)R00=12κρ
when one recalls that for the Earth-as for any nearly Newtonian system-the stresses T j k T j k T_(jk)T_{j k}Tjk are very small compared to the density of mass-energy T 00 = ρ T 00 = ρ T_(00)=rhoT_{00}=\rhoT00=ρ :
| T j k | T 00 pressure density d p d ρ ( velocity of sound ) 2 1 . T j k T 00  pressure   density  d p d ρ (  velocity of sound  ) 2 1 . (|T_(jk)|)/(T_(00))∼(" pressure ")/(" density ")∼(dp)/(d rho)∼(" velocity of sound ")^(2)≪1.\frac{\left|T_{j k}\right|}{T_{00}} \sim \frac{\text { pressure }}{\text { density }} \sim \frac{d p}{d \rho} \sim(\text { velocity of sound })^{2} \ll 1 .|Tjk|T00 pressure  density dpdρ( velocity of sound )21.
The equation R 00 = 4 π ρ R 00 = 4 π ρ R_(00)=4pi rhoR_{00}=4 \pi \rhoR00=4πρ (derived by comparing relative accelerations in the Newton and Einstein theories) and the equation R 00 = 1 2 κ ρ R 00 = 1 2 κ ρ R_(00)=(1)/(2)kappa rhoR_{00}=\frac{1}{2} \kappa \rhoR00=12κρ (derived directly from the Einstein field equation) can agree only if the proportionality constant κ κ kappa\kappaκ is 8 π 8 π 8pi8 \pi8π.
Thus, the Einstein field equation, describing the generation of curvature by mass-energy, must read
(17.9) G = 8 π T (17.9) G = 8 π T {:(17.9)G=8pi T:}\begin{equation*} \boldsymbol{G}=8 \pi \boldsymbol{T} \tag{17.9} \end{equation*}(17.9)G=8πT
The lefthand side ("curvature") has units cm 2 cm 2 cm^(-2)\mathrm{cm}^{-2}cm2, since a curvature tensor is a linear machine into which one inserts a displacement (units: cm ) and from which one gets a relative acceleration (units: cm / sec 2 cm / cm 2 cm 1 cm / sec 2 cm / cm 2 cm 1 cm//sec^(2)∼cm//cm^(2)∼cm^(-1)\mathrm{cm} / \mathrm{sec}^{2} \sim \mathrm{~cm} / \mathrm{cm}^{2} \sim \mathrm{~cm}^{-1}cm/sec2 cm/cm2 cm1 ). The right-hand side also has dimensions cm 2 cm 2 cm^(-2)\mathrm{cm}^{-2}cm2, since it is a linear machine into which one inserts 4 -velocity (dimensionless) and from which one gets mass density [units: g / cm 3 cm / cm 3 g / cm 3 cm / cm 3 g//cm^(3)∼cm//cm^(3)∼\mathrm{g} / \mathrm{cm}^{3} \sim \mathrm{~cm} / \mathrm{cm}^{3} \simg/cm3 cm/cm3 cm 2 cm 2 cm^(-2)\mathrm{cm}^{-2}cm2; recall from equation (1.12) and Box 1.8 that g = ( 1 g ) × ( G / c 2 ) = ( 1 g ) × g = ( 1 g ) × G / c 2 = ( 1 g ) × g=(1g)xx(G//c^(2))=(1g)xx\mathrm{g}=(1 \mathrm{~g}) \times\left(G / c^{2}\right)=(1 \mathrm{~g}) \timesg=(1 g)×(G/c2)=(1 g)× ( 0.742 × 10 28 cm / g ) = 0.742 × 10 28 cm ] 0.742 × 10 28 cm / g = 0.742 × 10 28 cm {:(0.742 xx10^(-28)(cm)//g)=0.742 xx10^(-28)(cm)]\left.\left(0.742 \times 10^{-28} \mathrm{~cm} / \mathrm{g}\right)=0.742 \times 10^{-28} \mathrm{~cm}\right](0.742×1028 cm/g)=0.742×1028 cm].
This concludes the simplest derivation of Einstein's field equation that has come to hand, and establishes its correspondence with the Newtonian theory of gravity under Newtonian conditions. That correspondence had to be worked out to determine the factor κ = 8 π κ = 8 π kappa=8pi\kappa=8 \piκ=8π on the righthand side of (17.9). Apart from this factor, the central point in the derivation was the demand for, and the existence of, a unique tensorial measure of curvature G G G\boldsymbol{G}G with an identically vanishing divergence.

Exercise 17.1. UNIQUENESS OF THE EINSTEIN TENSOR

EXERCISES

(a) Show that the most general second-rank, symmetric tensor constructable from Riemann and g g g\boldsymbol{g}g, and linear in Riemann, is
a R α β + b R g α β + Λ g α β (17.10) = a R μ α μ β + b R μ μ ν g α β + Λ g α β , a R α β + b R g α β + Λ g α β (17.10) = a R μ α μ β + b R μ μ ν g α β + Λ g α β , {:[aR_(alpha beta)+bRg_(alpha beta)+Lambdag_(alpha beta)],[(17.10)=aR^(mu)_(alpha mu beta)+bR^(mu_(mu nu))g_(alpha beta)+Lambdag_(alpha beta)","]:}\begin{gather*} a R_{\alpha \beta}+b R g_{\alpha \beta}+\Lambda g_{\alpha \beta} \\ =a R^{\mu}{ }_{\alpha \mu \beta}+b R^{\mu{ }_{\mu \nu}} g_{\alpha \beta}+\Lambda g_{\alpha \beta}, \tag{17.10} \end{gather*}aRαβ+bRgαβ+Λgαβ(17.10)=aRμαμβ+bRμμνgαβ+Λgαβ,
where a , b a , b a,ba, ba,b, and Λ Λ Lambda\LambdaΛ are constants.
(b) Show that this tensor has an automatically vanishing divergence if and only if b = 1 2 a b = 1 2 a b=-(1)/(2)ab=-\frac{1}{2} ab=12a.
(c) Show that, in addition, this tensor vanishes in flat spacetime, if and only if Λ = 0 Λ = 0 Lambda=0\Lambda=0Λ=0-i.e., if and only if it is a multiple of the Einstein tensor G α β = R α β 1 2 R g α β G α β = R α β 1 2 R g α β G_(alpha beta)=R_(alpha beta)-(1)/(2)Rg_(alpha beta)G_{\alpha \beta}=R_{\alpha \beta}-\frac{1}{2} R g_{\alpha \beta}Gαβ=Rαβ12Rgαβ. (Do not bother to prove that G 0 G 0 grad*G-=0\boldsymbol{\nabla} \cdot \boldsymbol{G} \equiv 0G0; assume it as a result from Chapter 13.)

Exercise 17.2. NO TENSOR CONSTRUCTABLE FROM FIRST DERIVATIVES OF METRIC

Show that there exists no tensor with components constructable from the ten metric coefficients g α β g α β g_(alpha beta)g_{\alpha \beta}gαβ and their 40 first derivatives g α β , μ g α β , μ g_(alpha beta,mu)g_{\alpha \beta, \mu}gαβ,μ-except the metric tensor g g g\boldsymbol{g}g, and products of it with itself; e.g., g g g g g ox g\boldsymbol{g} \otimes \boldsymbol{g}gg. [Hint: Assume there exists some other such tensor, and examine its hypothesized components in a local inertial frame.]
Result: "Einstein field equation" G = 8 π T G = 8 π T G=8pi T\boldsymbol{G}=8 \pi \boldsymbol{T}G=8πT

Exercise 17.3. RIEMANN AS THE ONLY TENSOR CONSTRUCTABLE FROM, AND LINEAR IN SECOND DERIVATIVES OF METRIC

Show that (1) Riemann, (2) g, and (3) tensors (e.g., Ricci) formed from Riemann and g g g\boldsymbol{g}g but linear in Riemann, are the only tensors that (a) are constructable from the ten g α β g α β g_(alpha beta)g_{\alpha \beta}gαβ, the 40 g α β , μ 40 g α β , μ 40g_(alpha beta,mu)40 g_{\alpha \beta, \mu}40gαβ,μ, and the 100 g α β , μ ν 100 g α β , μ ν 100g_(alpha beta,mu nu)100 g_{\alpha \beta, \mu \nu}100gαβ,μν, and (b) are linear in the g α β , μ ν g α β , μ ν g_(alpha beta,mu nu)g_{\alpha \beta, \mu \nu}gαβ,μν. [Hint: Assume there exists some other such tensor, and examine its hypothesized components in an orthonormal, Riemann-normal coordinate system. Use equations (11.30) to (11.32).]

Exercise 17.4. UNIQUENESS OF THE EINSTEIN TENSOR

(a) Show that the Einstein tensor, G α β = R α β 1 2 R g α β G α β = R α β 1 2 R g α β G_(alpha beta)=R_(alpha beta)-(1)/(2)Rg_(alpha beta)G_{\alpha \beta}=R_{\alpha \beta}-\frac{1}{2} R g_{\alpha \beta}Gαβ=Rαβ12Rgαβ, is the only second-rank, symmetric tensor that (1) has components constructable solely from g α β , g α β , μ , g α β , μ ν g α β , g α β , μ , g α β , μ ν g_(alpha beta),g_(alpha beta,mu),g_(alpha beta,mu nu)g_{\alpha \beta}, g_{\alpha \beta, \mu}, g_{\alpha \beta, \mu \nu}gαβ,gαβ,μ,gαβ,μν; (2) has components linear in g α β , μ ν g α β , μ ν g_(alpha beta,mu nu)g_{\alpha \beta, \mu \nu}gαβ,μν; (3) has an automatically vanishing divergence, G = 0 G = 0 grad*G=0\boldsymbol{\nabla} \cdot \boldsymbol{G}=0G=0; and (4) vanishes in flat spacetime. This provides added motivation for choosing the Einstein tensor as the left side of the field equation G = 8 π T G = 8 π T G=8pi T\boldsymbol{G}=8 \pi \boldsymbol{T}G=8πT.
(b) Show that, when condition (4) is dropped, the most general tensor is G + Λ g G + Λ g G+Lambda g\boldsymbol{G}+\Lambda \boldsymbol{g}G+Λg, where Λ Λ Lambda\LambdaΛ is a constant. (See § 17.3 § 17.3 §17.3\S 17.3§17.3 for the significance of this.)
Einstein field equation governs the evolution of spacetime geometry

§17.2. AUTOMATIC CONSERVATION OF THE SOURCE: A DYNAMIC NECESSITY

The answer G = 8 π T G = 8 π T G=8pi T\boldsymbol{G}=8 \pi \boldsymbol{T}G=8πT is now on hand; but what is the question? An equation has been derived that connects the Einstein-Cartan "moment of rotation" G G G\boldsymbol{G}G with the stress-energy tensor T T T\boldsymbol{T}T, but what is the purpose for which one wants this equation in the first place? If geometry tells matter how to move, and matter tells geometry how to curve, does one not have in one's hands a Gordian knot? And how then can one ever untie it?
The story is no different in character for the dynamics of geometry than it is for other branches of dynamics. To predict the future, one must first specify, on an "initial" hypersurface of "simultaneity," the position and velocity of every particle, and the amplitude and time-rate of change of every field that obeys a second-order wave equation. One can then evolve the particles and fields forward in time by means of their dynamic equations. Similarly, one must give information about the geometry and its first time-rate of change on the "initial" hypersurface if the Einstein field equation is to be able to predict completely and deterministically the future timedevelopment of the entire system, particles plus fields plus geometry. (See Chapter 21 for details.)
If a prediction is to be made of the geometry, how much information has to be supplied for this purpose? The geometry of spacetime is described by the metric
d s 2 = g α β ( P ) d x α d x β d s 2 = g α β ( P ) d x α d x β ds^(2)=g_(alpha beta)(P)dx^(alpha)dx^(beta)d s^{2}=g_{\alpha \beta}(\mathscr{P}) d x^{\alpha} d x^{\beta}ds2=gαβ(P)dxαdxβ
that is, by the ten functions g α β g α β g_(alpha beta)g_{\alpha \beta}gαβ of location P P P\mathscr{P}P in spacetime. It might then seem that ten functions must be predicted; and, if so, that one would need for the task ten
equations. Not so. Introduce a new set of coordinates x μ ¯ x μ ¯ x^( bar(mu))x^{\bar{\mu}}xμ¯ by way of the coordinate transformations
x α = x α ( x μ ¯ ) x α = x α x μ ¯ x^(alpha)=x^(alpha)(x^( bar(mu)))x^{\alpha}=x^{\alpha}\left(x^{\bar{\mu}}\right)xα=xα(xμ¯)
and find the same spacetime geometry, with all the same bumps, rills, and waves, described by an entirely new set of metric coefficients g α ¯ β ¯ ( P ) g α ¯ β ¯ ( P ) g_( bar(alpha) bar(beta))(P)g_{\bar{\alpha} \bar{\beta}}(\mathscr{P})gα¯β¯(P).
It would transgress the power as well as the duty of Einstein's "geometrodynamic law" G = 8 π T G = 8 π T G=8pi T\boldsymbol{G}=8 \pi \boldsymbol{T}G=8πT if, out of the appropriate data on the "initial-value hypersurface," it were to provide a way to calculate, on out into the future, values for all ten functions g α β ( P ) g α β ( P ) g_(alpha beta)(P)g_{\alpha \beta}(\mathscr{P})gαβ(P). To predict all ten functions would presuppose a choice of the coordinates; and to make a choice among coordinate systems is exactly what the geometrodynamic law cannot and must not have the power to do. That choice resides of necessity in the man who studies the geometry, not in the Nature that makes the geometry. The geometry in and by itself, like an automobile fender in and by itself, is free of coordinates. The coordinates are the work of man.
It follows that the ten components G α β = 8 π T α β G α β = 8 π T α β G_(alpha beta)=8piT_(alpha beta)G_{\alpha \beta}=8 \pi T_{\alpha \beta}Gαβ=8πTαβ of the field equation must not determine completely and uniquely all ten components g μ ν g μ ν g_(mu nu)g_{\mu \nu}gμν of the metric. On the contrary, G α β = 8 π T α β G α β = 8 π T α β G_(alpha beta)=8piT_(alpha beta)G_{\alpha \beta}=8 \pi T_{\alpha \beta}Gαβ=8πTαβ must place only six independent constraints on the ten g μ ν ( P ) g μ ν ( P ) g_(mu nu)(P)g_{\mu \nu}(\mathscr{P})gμν(P), leaving four arbitrary functions to be adjusted by man's specialization of the four coordinate functions x α ( P ) x α ( P ) x^(alpha)(P)x^{\alpha}(\mathscr{P})xα(P).
How can this be so? How can the ten equations G α β = 8 π T α β G α β = 8 π T α β G_(alpha beta)=8piT_(alpha beta)G_{\alpha \beta}=8 \pi T_{\alpha \beta}Gαβ=8πTαβ be in reality only six? Answer: by virtue of the "automatic conservation of the source." More specifically, the identity G α β ; β 0 G α β ; β 0 G^(alpha beta)_(;beta)-=0G^{\alpha \beta}{ }_{; \beta} \equiv 0Gαβ;β0 guarantees that the ten equations G α β = 8 π T α β G α β = 8 π T α β G_(alpha beta)=8piT_(alpha beta)G_{\alpha \beta}=8 \pi T_{\alpha \beta}Gαβ=8πTαβ contain the four "conservation laws" T α β ; β = 0 T α β ; β = 0 T^(alpha beta)_(;beta)=0T^{\alpha \beta}{ }_{; \beta}=0Tαβ;β=0. These four conservation laws-along with other equations-govern the evolution of the source. They do not constrain in any way the evolution of the geometry. The geometry is constrained only by the six remaining, independent equations in G α β = 8 π T α β G α β = 8 π T α β G_(alpha beta)=8piT_(alpha beta)G_{\alpha \beta}=8 \pi T_{\alpha \beta}Gαβ=8πTαβ.
When viewed in this way, the "automatic conservation of the source" is not merely a philosophically attractive principle. It is, in fact, an absolute dynamic necessity. Without "automatic conservation of the source," the ten G α β = 8 π T α β G α β = 8 π T α β G_(alpha beta)=8piT_(alpha beta)G_{\alpha \beta}=8 \pi T_{\alpha \beta}Gαβ=8πTαβ would place ten constraints on the ten g α β g α β g_(alpha beta)g_{\alpha \beta}gαβ, thus fixing the coordinate system as well as the geometry. With "automatic conservation," the ten G α β = 8 π T α β G α β = 8 π T α β G_(alpha beta)=8piT_(alpha beta)G_{\alpha \beta}=8 \pi T_{\alpha \beta}Gαβ=8πTαβ place four constraints (local conservation of energy and momentum) on the source, and six constraints on the ten g α β g α β g_(alpha beta)g_{\alpha \beta}gαβ, leaving four of the g α β g α β g_(alpha beta)g_{\alpha \beta}gαβ to be adjusted by adjustment of the coordinate system.

§17.3. COSMOLOGICAL CONSTANT

In 1915, when Einstein developed his general relativity theory, the permanence of the universe was a fixed item of belief in Western philosophy. "The heavens endure from everlasting to everlasting." Thus, it disturbed Einstein greatly to discover (Chapter 27) that his geometrodynamic law G = 8 π T G = 8 π T G=8pi T\boldsymbol{G}=8 \pi \boldsymbol{T}G=8πT predicts a nonpermanent universe; a dynamic universe; a universe that originated in a "big-bang" explosion,
Einstein's motivation for introducing a cosmological constant
or will be destroyed eventually by contraction to infinite density, or both. Faced with this contradiction between his theory and the firm philosophical beliefs of the day, Einstein weakened; he modified his theory.
The only conceivable modification that does not alter vastly the structure of the theory is to change the lefthand side of the geometrodynamic law G = 8 π T G = 8 π T G=8pi T\boldsymbol{G}=8 \pi \boldsymbol{T}G=8πT. Recall that the lefthand side is forced to be the Einstein tensor, G α β = R α β 1 2 R g α β G α β = R α β 1 2 R g α β G_(alpha beta)=R_(alpha beta)-(1)/(2)Rg_(alpha beta)G_{\alpha \beta}=R_{\alpha \beta}-\frac{1}{2} R g_{\alpha \beta}Gαβ=Rαβ12Rgαβ, by three assumptions:
(1) G G G\boldsymbol{G}G vanishes when spacetime is flat.
(2) G G G\boldsymbol{G}G is constructed from the Riemann curvature tensor and the metric and nothing else.
(3) G G G\boldsymbol{G}G is distinguished from other tensors that can be built from Riemann and g g g\boldsymbol{g}g by the demands (1) that it be linear in Riemann, as befits any natural measure of curvature; (2) that, like T T T\boldsymbol{T}T, it be symmetric and of second rank; and (3) that it have an automatically vanishing divergence, G 0 G 0 grad*G-=0\boldsymbol{\nabla} \cdot \boldsymbol{G} \equiv 0G0.
Denote a new, modified lefthand side by " G G G\mathbf{G}G ", with quotation marks to avoid confusion with the standard Einstein tensor. To abandon grad*\boldsymbol{\nabla} \cdot " G G G\boldsymbol{G}G " 0 0 -=0\equiv 00 is impossible on dynamic grounds (see §17.2). To change the symmetry or rank of " G G G\boldsymbol{G}G " is impossible on mathematical grounds, since " G G G\boldsymbol{G}G " must be equated to T T T\boldsymbol{T}T. To let " G G G\boldsymbol{G}G " be nonlinear in Riemann would vastly complicate the theory. To construct " G G G\boldsymbol{G}G " from anything except Riemann and g g g\boldsymbol{g}g would make " G G G\boldsymbol{G}G " no longer a measure of spacetime geometry and would thus violate the spirit of the theory. After much anguish, one concludes that the assumption which one might drop with least damage to the beauty and spirit of the theory is assumption (1), that " G G G\boldsymbol{G}G " vanish when spacetime is flat. But even dropping this assumption is painful: (1) although " G G G\boldsymbol{G}G " might still be in some sense a measure of geometry, it can no longer be a measure of curvature; and (2) flat, empty spacetime will no longer be compatible with the geometrodynamic law ( G 0 G 0 G!=0\boldsymbol{G} \neq 0G0 in flat, empty space, where T = 0 T = 0 T=0\boldsymbol{T}=0T=0 ). Nevertheless, these consequences were less painful to Einstein than a dynamic universe.
The only tensor that satisfies conditions (2) and (3) [with (1) abandoned] is the Einstein tensor plus a multiple of the metric:
" G α β " = R α β 1 2 g α β R + Λ g α β = G α β + Λ g α β " G α β " = R α β 1 2 g α β R + Λ g α β = G α β + Λ g α β "G_(alpha beta)"=R_(alpha beta)-(1)/(2)g_(alpha beta)R+Lambdag_(alpha beta)=G_(alpha beta)+Lambdag_(alpha beta)" G_{\alpha \beta} "=R_{\alpha \beta}-\frac{1}{2} g_{\alpha \beta} R+\Lambda g_{\alpha \beta}=G_{\alpha \beta}+\Lambda g_{\alpha \beta}"Gαβ"=Rαβ12gαβR+Λgαβ=Gαβ+Λgαβ
(exercise 17.1; see also exercise 17.4). Thus was Einstein (1917) led to his modified field equation
(17.11) G + Λ g = 8 π T . (17.11) G + Λ g = 8 π T . {:(17.11)G+Lambda g=8pi T.:}\begin{equation*} \boldsymbol{G}+\Lambda \boldsymbol{g}=8 \pi \boldsymbol{T} . \tag{17.11} \end{equation*}(17.11)G+Λg=8πT.
The constant Λ Λ Lambda\LambdaΛ he called the "cosmological constant"; it has dimensions cm 2 cm 2 cm^(-2)\mathrm{cm}^{-2}cm2.
The modified field equation, by contrast with the original, admits a static, unchanging universe as one particular solution (see Box 27.5). For this reason, Einstein in 1917 was inclined to place his faith in the modified equation. But thirteen years later Hubble discovered the expansion of the universe. No longer was the cosmological constant necessary. Einstein, calling the cosmological constant "the biggest
Why Einstein abandoned the cosmological constant
Einstein's field equation with the cosmological constant
blunder of my life," abandoned it and returned to his original geometrodynamic law, G = 8 π T G = 8 π T G=8pi T\boldsymbol{G}=8 \pi \boldsymbol{T}G=8πT [Einstein (1970)].
A great mistake Λ Λ Lambda\LambdaΛ was indeed!-not least because, had Einstein stuck by his original equation, he could have claimed the expansion of the universe as the most triumphant prediction of his theory of gravity.
A mischievous genie, once let out of a bottle, is not easily reconfined. Many workers in cosmology are unwilling to abandon the cosmological constant. They insist that it be abandoned only after cosmological observations reveal it to be negligibly small. As a modern-day motivation for retaining the cosmological constant, one sometimes rewrites the modified field equation in the form
(17.12a) G = 8 π [ T + T ( VAC ) ] (17.12b) T ( VAC ) ( Λ / 8 π ) g (17.12a) G = 8 π T + T ( VAC ) (17.12b) T ( VAC ) ( Λ / 8 π ) g {:[(17.12a)G=8pi[T+T^((VAC))]],[(17.12b)T^((VAC))-=-(Lambda//8pi)g]:}\begin{gather*} \boldsymbol{G}=8 \pi\left[\boldsymbol{T}+\boldsymbol{T}^{(\mathrm{VAC})}\right] \tag{17.12a}\\ \boldsymbol{T}^{(\mathrm{VAC})} \equiv-(\Lambda / 8 \pi) \boldsymbol{g} \tag{17.12b} \end{gather*}(17.12a)G=8π[T+T(VAC)](17.12b)T(VAC)(Λ/8π)g
and interprets T (vaC) T (vaC)  T^((vaC) )\boldsymbol{T}^{\text {(vaC) }}T(vaC)  as a stress-energy tensor associated with the vacuum. This viewpoint speculates [Zel'dovich (1967)] that the vacuum polarization of quantum field theory endows the vacuum with the nonzero stress-energy tensor (17.12b), which is completely unobservable except by its gravitational effects. Unfortunately, today's quantum field theory is too primitive to allow a calculation of T ( VAC ) T ( VAC ) T^((VAC))\boldsymbol{T}^{(\mathrm{VAC})}T(VAC) from first principles. (See, however, exercise 17.5.)
The mass-energy density that the cosmological constant attributes to the vacuum is
(17.13) ρ ( VAC ) = T 0 ^ 0 ^ ( VAC ) = + Λ / 8 π (17.13) ρ ( VAC ) = T 0 ^ 0 ^ ( VAC ) = + Λ / 8 π {:(17.13)rho^((VAC))=T_( hat(0) hat(0))^((VAC))=+Lambda//8pi:}\begin{equation*} \rho^{(\mathrm{VAC})}=T_{\hat{0} \hat{0}}^{(\mathrm{VAC})}=+\Lambda / 8 \pi \tag{17.13} \end{equation*}(17.13)ρ(VAC)=T0^0^(VAC)=+Λ/8π
If Λ 0 Λ 0 Lambda!=0\Lambda \neq 0Λ0, it must at least be so small that ρ ( VAC ) ρ ( VAC ) rho^((VAC))\rho^{(\mathrm{VAC})}ρ(VAC) has negligible gravitational effects [ ρ ( VAC ) ∣< ρ ( MATTER ) ] ρ ( VAC ∣< ρ ( MATTER  ) {:[∣rho^((VAC))∣<rho^(("MATTER "))]\left.\left[\mid \rho^{(\mathrm{VAC}}\right) \mid<\rho^{(\text {MATTER })}\right][ρ(VAC)∣<ρ(MATTER )] wherever Newton's theory of gravity gives a successful account of observations. The systems of lowest density to which one applies Newtonian theory with some (though not great) success are small clusters of galaxies. Hence, one can place the limit
(17.14) | ρ ( VAC ) | = | Λ | / 8 π ρ ( CLUSTER ) 10 29 g / cm 3 10 57 cm 2 (17.14) ρ ( VAC ) = | Λ | / 8 π ρ ( CLUSTER ) 10 29 g / cm 3 10 57 cm 2 {:(17.14)|rho^((VAC))|=|Lambda|//8pi <= rho^((CLUSTER))∼10^(-29)g//cm^(3)∼10^(-57)cm^(-2):}\begin{equation*} \left|\rho^{(\mathrm{VAC})}\right|=|\Lambda| / 8 \pi \leqq \rho^{(\mathrm{CLUSTER})} \sim 10^{-29} \mathrm{~g} / \mathrm{cm}^{3} \sim 10^{-57} \mathrm{~cm}^{-2} \tag{17.14} \end{equation*}(17.14)|ρ(VAC)|=|Λ|/8πρ(CLUSTER)1029 g/cm31057 cm2
on the value of the cosmological constant. Evidently, even if Λ 0 , Λ Λ 0 , Λ Lambda!=0,Lambda\Lambda \neq 0, \LambdaΛ0,Λ is so small that it is totally unimportant on the scale of a galaxy or a star or a planet or a man or an atom. Consequently it is reasonable to stick with Einstein's original geometrodynamic law ( G = 8 π T ; Λ = 0 G = 8 π T ; Λ = 0 G=8pi T;Lambda=0\boldsymbol{G}=8 \pi \boldsymbol{T} ; \Lambda=0G=8πT;Λ=0 ) everywhere, except occasionally when discussing cosmology (Chapters 27-30).
Exercise 17.5. MAGNITUDE OF COSMOLOGICAL CONSTANT
(a) What is the order of magnitude of the influence of the cosmological constant on the celestial mechanics of the solar system if Λ 10 57 cm 2 Λ 10 57 cm 2 Lambda∼10^(-57)cm^(-2)\Lambda \sim 10^{-57} \mathrm{~cm}^{-2}Λ1057 cm2 ?
A modern-day motivation for the cosmological constant: vacuum polarization
Observational limit on the cosmological constant
Why one ignores the cosmological constant everywhere except in cosmology
(b) Show that the mass-energy density of the vacuum ρ ( VAC ) = Λ / 8 π 10 29 g / cm 3 ρ ( VAC ) = Λ / 8 π 10 29 g / cm 3 rho^((VAC))=Lambda//8pi∼10^(-29)g//cm^(3)\rho^{(\mathrm{VAC})}=\Lambda / 8 \pi \sim 10^{-29} \mathrm{~g} / \mathrm{cm}^{3}ρ(VAC)=Λ/8π1029 g/cm3, corresponding to the maximum possible value of Λ Λ Lambda\LambdaΛ, agrees in very rough magnitude with
rest mass of an elementary particle ( Compton wavelength of particle) ×  rest mass of an elementary particle  (  Compton wavelength of particle)  × (" rest mass of an elementary particle ")/((" Compton wavelength of particle) ")xx\frac{\text { rest mass of an elementary particle }}{(\text { Compton wavelength of particle) }} \times rest mass of an elementary particle ( Compton wavelength of particle) × (gravitational fine-structure constant)
= m ( / m ) 3 m 2 = m 6 4 = m ( / m ) 3 m 2 = m 6 4 =(m)/((ℏ//m)^(3))(m^(2))/(ℏ)=(m^(6))/(ℏ^(4))=\frac{m}{(\hbar / m)^{3}} \frac{m^{2}}{\hbar}=\frac{m^{6}}{\hbar^{4}}=m(/m)3m2=m64
[Zel'dovich ( 1967 , 1968 ) ( 1967 , 1968 ) (1967,1968)(1967,1968)(1967,1968) ]. This numerology is suggestive, but has not led to any believable derivation of a stress-energy tensor for the vacuum.

§17.4. THE NEWTONIAN LIMIT

Just as quantum mechanics reduces to classical mechanics in the "correspondence limit" of large actions, I I I≫ℏI \gg \hbarI, so general relativity reduces to Newtonian theory in the "correspondence limit" of weak gravity and low velocities. (On "correspondence limits," see Box 17.1.) This section elucidates, in some mathematical detail, the correspondence between general relativity and Newtonian theory. It begins with "passive" aspects of gravitation (response of matter to gravity) and then turns to "active" aspects (generation of gravity by matter).
Consider an isolated system-e.g., the solar system-in which Newtonian theory is highly accurate. In order that special relativistic effects not be noticeable, all

Box 17.1 CORRESPONDENCE PRINCIPLES

A. General Remarks and Specific Examples

  1. As physics develops and expands, its unity is maintained by a network of correspondence principles, through which simpler theories maintain their vitality by links to more sophisticated but more accurate ones.
    a. Physical optics, with all the new diffraction and interference phenomena for which it accounted, nevertheless also had to account, and did account, for the old, elementary, geometric optics of mirrors and lenses. Geometric optics is recovered from physical optics in the mathematical "correspondence
    principle limit" in which the wavelength is made indefinitely small in comparison with all other relevant dimensions of the physical system.
    b. Newtonian mechanics is recovered from the mechanics of special relativity in the mathematical "correspondence principle limit" in which all relevant velocities are negligibly small compared to the speed of light.
    c. Thermodynamics is recovered from its successor theory, statistical mechanics, in the mathematical "correspondence principle limit" in which so many particles are taken into account that fluctuations in pressure,
    particle number, and other physical quantities are negligible compared to the average values of these parameters of the system.
    d. Classical mechanics is recovered from quantum mechanics in the "correspondence principle limit" in which the quantum numbers of the quantum states in question are so large, or the quantities of action that come into play are so great compared to \hbar, that wave and diffraction phenomena make negligible changes in the predictions of standard deterministic classical mechanics. Niels Bohr formulated and took advantage of this correspondence principle even before any proper quantum theory existed. He used it to predict approximate values of atomic energy levels and of intensities of spectral lines. He also expounded it as a guide to all physicists, first in searching for a proper version of the quantum theory, and then in elucidating the content of this theory after it was found.
  2. In all these examples and others, the newer, more sophisticated theory is "better" than its predecessor because it gives a good description of a more extended domain of physics, or a more accurate description of the same domain, or both.
  3. The correspondence between the newer theory and its predecessor (a) gives one the power to recover the older theory from the newer; (b) can be exhibited by straightforward mathematics; and (c), according to the historical record, often guided the development of the newer theory.

B. Correspondence Structure of General Relativity

  1. Einstein's theory of gravity has as distinct limiting cases (a) special relativity; (b) the "linear-
    ized theory of gravity"; (c) Newton's theory of gravity; and (d) the post-Newtonian theory of gravity. Thus, it has a particularly rich correspondence structure.
    a. Correspondence with special relativity: General relativity has two distinct kinds of correspondence with special relativity. The first is the limit of vanishing gravitational field everywhere (vanishing curvature); in this limit one can introduce a global inertial frame, set g μ ν = η μ ν g μ ν = η μ ν g_(mu nu)=eta_(mu nu)g_{\mu \nu}=\eta_{\mu \nu}gμν=ημν, and recover completely and precisely the theory of special relativity. The second is local rather than global; it is the demand ("correspondence principle"; "equivalence principle") that in a local inertial frame all the laws of physics take on their special relativistic forms. As was seen in Chapter 16, this puts no restrictions on the metric (except that g μ ν = η μ ν g μ ν = η μ ν g_(mu nu)=eta_(mu nu)g_{\mu \nu}=\eta_{\mu \nu}gμν=ημν and g μ ν , α = 0 g μ ν , α = 0 g_(mu nu,alpha)=0g_{\mu \nu, \alpha}=0gμν,α=0 in local inertial frames); but it places severe constraints on the behavior of matter and fields in the presence of gravity.
    b. Correspondence with Newtonian theory: In the limit of weak gravitational fields, low velocities, and small pressures, general relativity reduces to Newton's theory of gravity. The correspondence structure is explored mathematically in the text of § 17.4 § 17.4 §17.4\S 17.4§17.4.
    c. Correspondence with post-Newtonian theory: When Newtonian theory is nearly valid, but "first-order relativistic corrections" might be important, one often uses the "post-Newtonian theory of gravity." Chapter 39 expounds the post-Newtonian theory and its correspondence with both general relativity and Newtonian theory.
    d. Correspondence with linearized theory: In the limit of weak gravitational fields, but possibly large velocities and pressures ( v 1 v 1 v∼1v \sim 1v1, T j k T 00 T j k T 00 T_(jk)∼T_(00)T_{j k} \sim T_{00}TjkT00 ) general relativity reduces to the "linearized theory of gravity". This correspondence is explored in Chapter 18.
Conditions which a system must satisfy for Newton's theory of gravity to be accurate
"Newtonian coordinates" defined
velocities in the system, relative to its center of mass and also relative to the Newtonian coordinates, must be small compared to the speed of light
(17.15a) v 1 . (17.15a) v 1 . {:(17.15a)v≪1.:}\begin{equation*} v \ll 1 . \tag{17.15a} \end{equation*}(17.15a)v1.
As a particle falls from the outer region of the system to the inner region, gravity accelerates it to a kinetic energy 1 2 m v 2 | m Φ | max 1 2 m v 2 | m Φ | max (1)/(2)mv^(2)∼|m Phi|_(max)\frac{1}{2} m v^{2} \sim|m \Phi|_{\max }12mv2|mΦ|max. [Here Φ < 0 Φ < 0 Phi < 0\Phi<0Φ<0 is Newton's gravitational potential, so normalized that Φ ( ) = 0 Φ ( ) = 0 Phi(oo)=0\Phi(\infty)=0Φ()=0.] The resulting velocity will be small only if
(17.15b) | Φ | 1 (17.15b) | Φ | 1 {:(17.15b)|Phi|≪1:}\begin{equation*} |\Phi| \ll 1 \tag{17.15b} \end{equation*}(17.15b)|Φ|1
Internal stresses in the system also produce motion-e.g., sound waves. Such waves have characteristic velocities of the order of | T i j / T 00 | 1 / 2 T i j / T 00 1 / 2 |T^(ij)//T^(00)|^(1//2)\left|T^{i j} / T^{00}\right|^{1 / 2}|Tij/T00|1/2-for example, the speed of sound in a perfect fluid is
v = ( d p / d ρ ) 1 / 2 ( p / ρ ) 1 / 2 | T i j / T 00 | 1 / 2 . v = ( d p / d ρ ) 1 / 2 ( p / ρ ) 1 / 2 T i j / T 00 1 / 2 . v=(dp//d rho)^(1//2)∼(p//rho)^(1//2)∼|T^(ij)//T^(00)|^(1//2).v=(d p / d \rho)^{1 / 2} \sim(p / \rho)^{1 / 2} \sim\left|T^{i j} / T^{00}\right|^{1 / 2} .v=(dp/dρ)1/2(p/ρ)1/2|Tij/T00|1/2.
In order that these velocities be small compared to the speed of light, all stresses must be small compared to the density of mass-energy
(17.15c) | T i j | / T 00 = | T i j | / ρ 1 (17.15c) T i j / T 00 = T i j / ρ 1 {:(17.15c)|T^(ij)|//T^(00)=|T^(ij)|//rho≪1:}\begin{equation*} \left|T^{i j}\right| / T^{00}=\left|T^{i j}\right| / \rho \ll 1 \tag{17.15c} \end{equation*}(17.15c)|Tij|/T00=|Tij|/ρ1
When, and only when conditions (17.15) hold, one can expect Newtonian theory to describe accurately the system being studied. Correspondence of general relativity with Newtonian theory for gravity in a passive role then demands that the geodesic world lines of freely falling particles reduce to the Newtonian world lines
(17.16) d 2 x i / d t 2 = Φ / x i (17.16) d 2 x i / d t 2 = Φ / x i {:(17.16)d^(2)x^(i)//dt^(2)=-del Phi//delx^(i):}\begin{equation*} d^{2} x^{i} / d t^{2}=-\partial \Phi / \partial x^{i} \tag{17.16} \end{equation*}(17.16)d2xi/dt2=Φ/xi
Moreover, they must reduce to this form in any relativistic coordinate system where the source and test particles have low velocities v 1 v 1 v≪1v \ll 1v1, and where coordinate lengths and times agree very nearly with the lengths and times of the Newtonian coor-dinates-which in turn are proper lengths and times as measured by rods and clocks. Thus, the relevant coordinates (called "Galilean" or "Newtonian" coordinates) are ones in which
(17.17) g μ ν = η μ ν + h μ ν , | h μ ν | 1 , | v j | = | d x j / d t | 1 (17.17) g μ ν = η μ ν + h μ ν , h μ ν 1 , v j = d x j / d t 1 {:(17.17)g_(mu nu)=eta_(mu nu)+h_(mu nu)","quad|h_(mu nu)|≪1","quad|v^(j)|=|dx^(j)//dt|≪1:}\begin{equation*} g_{\mu \nu}=\eta_{\mu \nu}+h_{\mu \nu}, \quad\left|h_{\mu \nu}\right| \ll 1, \quad\left|v^{j}\right|=\left|d x^{j} / d t\right| \ll 1 \tag{17.17} \end{equation*}(17.17)gμν=ημν+hμν,|hμν|1,|vj|=|dxj/dt|1
(weak gravitational field; nearly inertial coordinates; low velocities). In such a coordinate system, the geodesic world lines of test particles have the form
d 2 x i d t 2 = d 2 x i d τ 2 ( since d t / d τ 1 when | h μ ν | 1 and | v j | 1 ) = Γ i α β d x α d τ d x β d τ (geodesic equation) = Γ i 00 (since d t / d τ 1 and | d x j / d τ | 1 ) = Γ i 00 ( since g μ ν η μ ν ) = 1 2 h 00 , i h 0 i , 0 (equation for Γ α β γ in terms of g α β , γ ) = 1 2 h 00 , i ( all velocities small compared to c implies time derivatives small compared to space derivatives i.e., h α β , 0 v h α β , i ) . d 2 x i d t 2 = d 2 x i d τ 2  since  d t / d τ 1  when  h μ ν 1  and  v j 1 = Γ i α β d x α d τ d x β d τ  (geodesic equation)  = Γ i 00  (since  d t / d τ 1  and  d x j / d τ 1 = Γ i 00  since  g μ ν η μ ν = 1 2 h 00 , i h 0 i , 0  (equation for  Γ α β γ  in terms of  g α β , γ = 1 2 h 00 , i  all velocities small compared to  c  implies time   derivatives small compared to space derivatives   i.e.,  h α β , 0 v h α β , i . {:[(d^(2)x^(i))/(dt^(2)),=(d^(2)x^(i))/(dtau^(2)),(" since "dt//d tau~~1" when "|h_(mu nu)|≪1" and "|v^(j)|≪1)],[,=-Gamma^(i)_(alpha beta)(dx^(alpha))/(d tau)(dx^(beta))/(d tau),quad" (geodesic equation) "],[,{:=-Gamma^(i)_(00)quad" (since "dt//d tau~~1" and "|dx^(j)//d tau|≪1)],[,=-Gamma_(i 00)quad(" since "g_(mu nu)~~eta_(mu nu))],[,{:=(1)/(2)h_(00,i)-h_(0i,0)quad" (equation for "Gamma_(alpha beta gamma)" in terms of "g_(alpha beta,gamma))],[,=(1)/(2)h_(00,i)quad([" all velocities small compared to "c" implies time "],[" derivatives small compared to space derivatives "],[-" i.e., "h_(alpha beta,0)∼vh_(alpha beta,i)]).]:}\begin{array}{rlr} \frac{d^{2} x^{i}}{d t^{2}} & =\frac{d^{2} x^{i}}{d \tau^{2}} & \left(\text { since } d t / d \tau \approx 1 \text { when }\left|h_{\mu \nu}\right| \ll 1 \text { and }\left|v^{j}\right| \ll 1\right) \\ & =-\Gamma^{i}{ }_{\alpha \beta} \frac{d x^{\alpha}}{d \tau} \frac{d x^{\beta}}{d \tau} & \quad \text { (geodesic equation) } \\ & \left.=-\Gamma^{i}{ }_{00} \quad \text { (since } d t / d \tau \approx 1 \text { and }\left|d x^{j} / d \tau\right| \ll 1\right) \\ & =-\Gamma_{i 00} \quad\left(\text { since } g_{\mu \nu} \approx \eta_{\mu \nu}\right) \\ & \left.=\frac{1}{2} h_{00, i}-h_{0 i, 0} \quad \text { (equation for } \Gamma_{\alpha \beta \gamma} \text { in terms of } g_{\alpha \beta, \gamma}\right) \\ & =\frac{1}{2} h_{00, i} \quad\left(\begin{array}{l} \text { all velocities small compared to } c \text { implies time } \\ \text { derivatives small compared to space derivatives } \\ - \text { i.e., } h_{\alpha \beta, 0} \sim v h_{\alpha \beta, i} \end{array}\right) . \end{array}d2xidt2=d2xidτ2( since dt/dτ1 when |hμν|1 and |vj|1)=Γiαβdxαdτdxβdτ (geodesic equation) =Γi00 (since dt/dτ1 and |dxj/dτ|1)=Γi00( since gμνημν)=12h00,ih0i,0 (equation for Γαβγ in terms of gαβ,γ)=12h00,i( all velocities small compared to c implies time  derivatives small compared to space derivatives  i.e., hαβ,0vhαβ,i).
These geodesic world lines do, indeed, reduce to those of Newtonian theory [equation (17.16)] if one makes the identification
(17.18) Γ i 00 = 1 2 h 00 , i = Φ , i . (17.18) Γ i 00 = 1 2 h 00 , i = Φ , i . {:(17.18)Gamma^(i)_(00)=-(1)/(2)h_(00,i)=Phi_(,i).:}\begin{equation*} \Gamma^{i}{ }_{00}=-\frac{1}{2} h_{00, i}=\Phi_{, i} . \tag{17.18} \end{equation*}(17.18)Γi00=12h00,i=Φ,i.
Together with the boundary conditions Φ ( r = ) = 0 Φ ( r = ) = 0 Phi(r=oo)=0\Phi(r=\infty)=0Φ(r=)=0 and h μ ν ( r = ) = 0 h μ ν ( r = ) = 0 h_(mu nu)(r=oo)=0h_{\mu \nu}(r=\infty)=0hμν(r=)=0 (coordinates Lorentz far from the source), this identification implies h 00 = 2 Φ h 00 = 2 Φ h_(00)=-2Phih_{00}=-2 \Phih00=2Φ; i.e.,
g 00 = 1 2 Φ g 00 = 1 2 Φ g_(00)=-1-2Phig_{00}=-1-2 \Phig00=12Φ for nearly Newtonian systems in Newtonian coordinates. (17.19)
Note that the correspondence tells one the form of h 00 h 00 h_(00)h_{00}h00 for nearly Newtonian systems, but not the forms of the other components of the metric perturbation. In fact, the other h μ ν h μ ν h_(mu nu)h_{\mu \nu}hμν could perfectly well be of the same order of magnitude as h 00 Φ h 00 Φ h_(00)∼Phih_{00} \sim \Phih00Φ, without influencing the world lines of slowly moving particles, because they always enter the geodesic equation multiplied by the small numbers v v vvv or v 2 v 2 v^(2)v^{2}v2, or differentiated by t t ttt rather than by x i x i x^(i)x^{i}xi. The forms of the other h μ ν h μ ν h_(mu nu)h_{\mu \nu}hμν and their small corrections to the Newtonian motion will be explored in Chapters 18, 39, and 40.
The relation g 00 = 1 2 Φ g 00 = 1 2 Φ g_(00)=-1-2Phig_{00}=-1-2 \Phig00=12Φ is the mathematical embodiment of the correspondence between general relativity theory and Newtonian theory for passive aspects of gravity. Together with the "validity conditions" (17.15, 17.17), it is a foundation from which one can derive all other aspects of the correspondence for "passive gravity," including the relation
(17.20) R i 0 j 0 = 2 Φ / x i x j (17.20) R i 0 j 0 = 2 Φ / x i x j {:(17.20)R^(i)_(0j0)=del^(2)Phi//delx^(i)delx^(j):}\begin{equation*} R^{i}{ }_{0 j 0}=\partial^{2} \Phi / \partial x^{i} \partial x^{j} \tag{17.20} \end{equation*}(17.20)Ri0j0=2Φ/xixj
(exercise 17.6). Alternatively, all other aspects of this correspondence can be derived by direct comparison of Newton's predictions with Einstein's. For example, to derive equation (17.20), examine the relative acceleration of two test particles, one at x i + ξ i x i + ξ i x^(i)+xi^(i)x^{i}+\xi^{i}xi+ξi and the other at x i x i x^(i)x^{i}xi. According to Newton
d 2 ξ i d t 2 = d 2 ( x i + ξ i ) d t 2 d 2 x i d t 2 = Φ x i | at x j + ξ + Φ x i | at x = 2 Φ x i x j ξ j . d 2 ξ i d t 2 = d 2 x i + ξ i d t 2 d 2 x i d t 2 = Φ x i at  x j + ξ + Φ x i at  x = 2 Φ x i x j ξ j . {:[(d^(2)xi^(i))/(dt^(2))=(d^(2)(x^(i)+xi^(i)))/(dt^(2))-(d^(2)x^(i))/(dt^(2))],[=-(del Phi)/(delx^(i))|_("at "x^(j)+xi^('))+(del Phi)/(delx^(i))|_("at "x^('))=(-del^(2)Phi)/(delx^(i)delx^(j))xi^(j).]:}\begin{aligned} \frac{d^{2} \xi^{i}}{d t^{2}} & =\frac{d^{2}\left(x^{i}+\xi^{i}\right)}{d t^{2}}-\frac{d^{2} x^{i}}{d t^{2}} \\ & =-\left.\frac{\partial \boldsymbol{\Phi}}{\partial x^{i}}\right|_{\text {at } x^{j}+\xi^{\prime}}+\left.\frac{\partial \boldsymbol{\Phi}}{\partial x^{i}}\right|_{\text {at } x^{\prime}}=\frac{-\partial^{2} \boldsymbol{\Phi}}{\partial x^{i} \partial x^{j}} \xi^{j} . \end{aligned}d2ξidt2=d2(xi+ξi)dt2d2xidt2=Φxi|at xj+ξ+Φxi|at x=2Φxixjξj.
For comparison, Einstein predicts (equation of geodesic deviation)
D 2 ξ i d τ 2 = d 2 ξ i d t 2 = R 0 j 0 i ξ j . [ by conditions (17.15) and (17.17)] D 2 ξ i d τ 2 = d 2 ξ i d t 2 = R 0 j 0 i ξ j . [  by conditions (17.15) and (17.17)]  {:[(D^(2)xi^(i))/(dtau^(2))=(d^(2)xi^(i))/(dt^(2))=-R_(0j0)^(i)xi^(j).],[[" by conditions (17.15) and (17.17)] "]:}\begin{aligned} \frac{D^{2} \xi^{i}}{d \tau^{2}} & =\frac{d^{2} \xi^{i}}{d t^{2}}=-R_{0 j 0}^{i} \xi^{j} . \\ & {[\text { by conditions (17.15) and (17.17)] }} \end{aligned}D2ξidτ2=d2ξidt2=R0j0iξj.[ by conditions (17.15) and (17.17)] 
Direct comparison gives relation (17.20).
Turn now from correspondence for passive aspects of gravity to correspondence for active aspects. According to Einstein, mass generates gravity (spacetime curvature) by the geometrodynamic law G = 8 π T G = 8 π T G=8pi T\boldsymbol{G}=8 \pi \boldsymbol{T}G=8πT. Apply this law to a nearly Newtonian system, and by the chain of reasoning that preceeds equation (17.8) derive the relation
(17.21) R 00 = 4 π ρ . (17.21) R 00 = 4 π ρ . {:(17.21)R_(00)=4pi rho.:}\begin{equation*} R_{00}=4 \pi \rho . \tag{17.21} \end{equation*}(17.21)R00=4πρ.
Einstein gravity reduces to Newton gravity only if, in Newtonian coordinates, g 00 = 1 2 Φ g 00 = 1 2 Φ g_(00)=-1-2Phig_{00}=-1-2 \Phig00=12Φ
The correspondence between Einstein theory and Newton theory for all "passive" aspects of gravity
The Newtonian limit of the Einstein field equation is 2 Φ = 4 π ρ 2 Φ = 4 π ρ grad^(2)Phi=4pi rho\nabla^{2} \Phi=4 \pi \rho2Φ=4πρ
Combine with the contraction of (17.20),
R 00 = R 0 i 0 i + R 0 000 = 2 Φ / x i x i = 2 Φ R 00 = R 0 i 0 i + R 0 000 = 2 Φ / x i x i = 2 Φ R_(00)=R_(0i0)^(i)+R_(uarr)^(0)_(000)=del^(2)Phi//delx^(i)delx^(i)=grad^(2)PhiR_{00}=R_{0 i 0}^{i}+R_{\uparrow}^{0}{ }_{000}=\partial^{2} \Phi / \partial x^{i} \partial x^{i}=\nabla^{2} \PhiR00=R0i0i+R0000=2Φ/xixi=2Φ
and thereby obtain Newton's equation for the generation of gravity by mass
(17.22) 2 Φ = 4 π ρ (17.22) 2 Φ = 4 π ρ {:(17.22)grad^(2)Phi=4pi rho:}\begin{equation*} \nabla^{2} \Phi=4 \pi \rho \tag{17.22} \end{equation*}(17.22)2Φ=4πρ
Thus, Einstein's field equation reduces to Newton's field equation in the Newtonian limit.
The correspondence between Newton and Einstein, although clear and straightforward as outlined above, is even more clear and straightforward when Newton's theory of gravity is rewritten in Einstein's language of curved spacetime (Chapter 12; exercise 17.7).

EXERCISES

Exercise 17.6. RAMIFICATIONS OF CORRESPONDENCE FOR GRAVITY IN A PASSIVE ROLE

From the correspondence relation g 00 = 1 2 Φ g 00 = 1 2 Φ g_(00)=-1-2Phig_{00}=-1-2 \Phig00=12Φ, and from conditions (17.15) and (17.17) for Newtonian physics, derive the correspondence relations
Γ i 00 = Φ / x i , R 0 j 0 i = 2 Φ / x i x j Γ i 00 = Φ / x i , R 0 j 0 i = 2 Φ / x i x j Gamma^(i)_(00)=del Phi//delx^(i),quadR_(0j0)^(i)=del^(2)Phi//delx^(i)delx^(j)\Gamma^{i}{ }_{00}=\partial \Phi / \partial x^{i}, \quad R_{0 j 0}^{i}=\partial^{2} \Phi / \partial x^{i} \partial x^{j}Γi00=Φ/xi,R0j0i=2Φ/xixj
Exercise 17.7. CORRESPONDENCE IN THE LANGUAGE OF CURVED SPACETIME [Track 2]
Exhibit the correspondence between the Einstein theory and Cartan's curved-spacetime formulation of Newtonian theory (Chapter 12).
There are many ways (Box 17.2) to derive the Einstein field equation

§17.5. AXIOMATIZE EINSTEIN'S THEORY?

Find the most compact and reasonable axiomatic structure one can for general relativity? Then from the axioms derive Einstein's field equation,
G = 8 π T ? G = 8 π T ? G=8pi T?\boldsymbol{G}=8 \pi \boldsymbol{T} ?G=8πT?
That approach would follow tradition. However, it may be out of date today. More than half a century has gone by since November 25, 1915. For all that time the equation has stood unchanged, if one ignores Einstein's temporary "aberration" of adding the cosmological constant. In contrast the derivations have evolved and become more numerous and more varied. In the beginning axioms told what equation is acceptable. By now the equation tells what axioms are acceptable. Box 17.2 sketches a variety of sets of axioms, and the resulting derivations of Einstein's equation.

Box 17.2 SIX ROUTES TO EINSTEIN'S GEOMETRODYNAMIC LAW OF THE EQUALITY OF CURVATURE AND ENERGY DENSITY ('EINSTEIN'S FIELD EQUATION'")

[Recommended to the attention of Track-1 readers are only route 1 (automatic conservation of the source, plus correspondence with Newtonian theory) and route 2 (Hilbert's variational principle); and even Track-2 readers are advised to finish the rest of this chapter before they study route 3 (physics on a spacelike slice), route 4 (going from superspace to Einstein's equation), route 5 (field of spin 2 in an "unobservable flat spacetime" background), and route 6 (gravitation as an elasticity of space that arises from particle physics).]
  1. Model geometrodynamics after electrodynamics and treat "automatic conservation of the source" and correspondence with the Newtonian theory of gravity as the central considerations.
    a. Particle responds in electrodynamics to field; in general relativity, to geometry.
    b. The potential for the electromagnetic field is the 4 -vector A A A\boldsymbol{A}A (components A μ A μ A_(mu)A_{\mu}Aμ ). The potential for the geometry is the metric tensor g g g\boldsymbol{g}g (components g μ ν g μ ν g_(mu nu)g_{\mu \nu}gμν ).
    c. The electromagnetic potential satisfies a wave equation with source term (4-current) on the right,
(1) ( A ν x μ A μ x ν ) ; p = 4 π j μ , (1) A ν x μ A μ x ν ; p = 4 π j μ , {:(1)((delA_(nu))/(delx^(mu))-(delA_(mu))/(delx^(nu)))^(;p)=4pij_(mu)",":}\begin{equation*} \left(\frac{\partial A_{\nu}}{\partial x^{\mu}}-\frac{\partial A_{\mu}}{\partial x^{\nu}}\right)^{; p}=4 \pi j_{\mu}, \tag{1} \end{equation*}(1)(AνxμAμxν);p=4πjμ,
so constructed that conservation of the source, j μ μ = 0 j μ μ = 0 j_(mu)^('mu)=0j_{\mu}{ }^{\prime \mu}=0jμμ=0, is automatic (consequence of an identity fulfilled by the lefthand side). By analogy, the geometrodynamic potential must also satisfy a wave equation with source term (stressenergy tensor) on the right,
(2) G μ ν = 8 π T μ ν , (2) G μ ν = 8 π T μ ν , {:(2)G_(mu nu)=8piT_(mu nu)",":}\begin{equation*} G_{\mu \nu}=8 \pi T_{\mu \nu}, \tag{2} \end{equation*}(2)Gμν=8πTμν,
so constructed that conservation of the source, T μ ν ; ν = 0 T μ ν ; ν = 0 T_(mu nu)^(;nu)=0T_{\mu \nu}{ }^{; \nu}=0Tμν;ν=0 (Chapter 16) is "automatic." This conservation is automatic here because the lefthand side of the equation is a tensor (the Einstein tensor; see Box 8.6 or Chapter 15), built from the metric components and their second derivatives, that fulfills the identity G μ ν ; ν 0 G μ ν ; ν 0 G_(mu nu)^(;nu)-=0G_{\mu \nu}{ }^{; \nu} \equiv 0Gμν;ν0.
d. No other tensor which (1) is linear in the second derivatives of the metric components, (2) is free of higher derivatives, and (3) vanishes in flat spacetime, satisfies such an identity.
e. The constant of proportionality ( 8 π ) ( 8 π ) (8pi)(8 \pi)(8π) is fixed by the choice of units [here geometric; see Box 1.8 ] and by the requirement ("correspondence with Newtonian theory") that a test particle shall oscillate back and forth through a collection of matter of density ρ ρ rho\rhoρ, or revolve in circular orbit around that collection of matter, at a circular frequency given by ω 2 = ( 4 π / 3 ) ρ ω 2 = ( 4 π / 3 ) ρ omega^(2)=(4pi//3)rho\omega^{2}=(4 \pi / 3) \rhoω2=(4π/3)ρ (Figure

Box 17.2 (continued)

1.12). The foregoing oversimplifies, and omits Einstein's temporary false turns, but otherwise summarizes the reasoning he pursued in arriving at his field equation. This reasoning is spelled out in more detail in the text of Chapter 17.
2. Take variational principle as central.
a. Construct out of the metric components the only scalar that exists that (1) is linear in the second derivatives of the metric tensor, (2) contains no higher derivatives, and (3) vanishes in flat spacetime: namely, the Riemann scalar curvature invariant, R R RRR.
b. Construct the invariant integral,
(3) I = 1 16 π Ω R ( g ) 1 / 2 d 4 x (3) I = 1 16 π Ω R ( g ) 1 / 2 d 4 x {:(3)I=(1)/(16 pi)int_(Omega)R(-g)^(1//2)d^(4)x:}\begin{equation*} I=\frac{1}{16 \pi} \int_{\Omega} R(-g)^{1 / 2} d^{4} x \tag{3} \end{equation*}(3)I=116πΩR(g)1/2d4x
c. Make small variations, δ g μ ν δ g μ ν deltag^(mu nu)\delta g^{\mu \nu}δgμν, in the metric coefficients g μ ν g μ ν g^(mu nu)g^{\mu \nu}gμν in the interior of the four-dimensional region Ω Ω Omega\OmegaΩ, and find that this integral changes by the amount
(4) δ I = 1 16 π Ω G μ ν δ g μ ν ( g ) 1 / 2 d 4 x (4) δ I = 1 16 π Ω G μ ν δ g μ ν ( g ) 1 / 2 d 4 x {:(4)delta I=(1)/(16 pi)int_(Omega)G_(mu nu)deltag^(mu nu)(-g)^(1//2)d^(4)x:}\begin{equation*} \delta I=\frac{1}{16 \pi} \int_{\Omega} G_{\mu \nu} \delta g^{\mu \nu}(-g)^{1 / 2} d^{4} x \tag{4} \end{equation*}(4)δI=116πΩGμνδgμν(g)1/2d4x
d. Demand that I I III should be an extremum with respect to the choice of geometry in the region interior to Ω ( δ I = 0 Ω δ I = 0 Omega(delta I=0:}\Omega\left(\delta I=0\right.Ω(δI=0 for arbitrary δ g μ ν δ g μ ν deltag^(mu nu)\delta g^{\mu \nu}δgμν; "principle of extremal action").
e. Thus arrive at the Einstein field equation for empty space,
(5) G μ ν = 0 . (5) G μ ν = 0 . {:(5)G_(mu nu)=0.:}\begin{equation*} G_{\mu \nu}=0 . \tag{5} \end{equation*}(5)Gμν=0.
f. The continuation of the reasoning leads to the identity
G μ ν ; ν = 0 . G μ ν ; ν = 0 . G_(mu nu);nu=0.G_{\mu \nu} ; \nu=0 .Gμν;ν=0.
Chapter 21, on the variational principle, gives more detail and takes up the additional term that appears on the righthand side of ( 5 ) ( 5 ) (5)(5)(5) when matter or fields or both are present.
g. This approach goes back to David Hilbert (1915). No route to the field equations is quicker. Moreover, it connects immediately (see the following section here, 2 2 2^(')2^{\prime}2 ) with the quantum principle of the "democracy of all histories" [Feynman (1942); Feynman and Hibbs (1965)]. The variational principle is spelled out in more detail in Chapter 21.
2 2 2^(')2^{\prime}2. An aside on the meaning of the classical action integral for the real world of quantum physics.
a. A "history of geometry," H H HHH, is a spacetime, that is to say, a four-dimensional manifold with four-dimensional -+++ Riemann metric that (1) reduces on one spacelike hypersurface ("hypersurface of simultaneity") to a specified "initial value 3 -geometry," A A AAA, with positive definite metric and (2) reduces on
another spacelike hypersurface to a specified "final value 3 -geometry," B B BBB, also with positive definite metric.
b. The classical variational principle of Hilbert, as reformulated by Arnowitt, Deser, and Misner, provides a prescription for the dynamical path length, I H I H I_(H)I_{H}IH, of any conceivable history H H HHH, classically allowed or not, that connects A A AAA and B B BBB (see Chapter 21 for a fuller statement for what can and must be specified on the initial hypersurface of simultaneity, and on the final one, and for alternative choices of the integrand in the action principle).
c. Classical physics says that a history H H HHH is allowed only if it extremizes the dynamic path length I I III as compared to all nearby histories. Quantum physics says that all histories occur with equal probability amplitude, in the following sense. The probability amplitude for "the dynamic geometry of space to transit from A A AAA to B B BBB " by way of the history H H HHH with action integral I H I H I_(H)I_{H}IH, and by way
H H HHH, is given by the expression
(6) ( probability amplitude to transit from A to B by way of history H and histories lying within the range D H about H ) exp ( i I H / ) N Q H . (6)  probability amplitude   to transit from  A  to  B  by way of history  H  and histories lying   within the range  D H  about  H exp i I H / N Q H . {:(6)([" probability amplitude "],[" to transit from "A" to "],[B" by way of history "H],[" and histories lying "],[" within the range "DH],[" about "H])∼exp(iI_(H)//ℏ)NQH.:}\left(\begin{array}{l} \text { probability amplitude } \tag{6}\\ \text { to transit from } A \text { to } \\ B \text { by way of history } H \\ \text { and histories lying } \\ \text { within the range } \mathscr{D} H \\ \text { about } H \end{array}\right) \sim \exp \left(i I_{H} / \hbar\right) N \mathscr{Q} H .(6)( probability amplitude  to transit from A to B by way of history H and histories lying  within the range DH about H)exp(iIH/)NQH.
Here the normalization factor, N N NNN, is the same for all conceivable histories H H HHH, allowed or not, that lead from A A AAA to B B BBB ("principle of democracy of histories"). The quantum of angular momentum, = h / 2 π = h / 2 π ℏ=h//2pi\hbar=h / 2 \pi=h/2π, expressed in geometric units, has the value
(7) = conv G / c 3 = ( L ) 2 , (7) = conv  G / c 3 = L 2 , {:(7)ℏ=ℏ_("conv ")G//c^(3)=(L^(**))^(2)",":}\begin{equation*} \hbar=\hbar_{\text {conv }} G / c^{3}=\left(L^{*}\right)^{2}, \tag{7} \end{equation*}(7)=conv G/c3=(L)2,
where L L L^(**)L^{*}L is the Planck length, L = 1.6 × 10 33 cm L = 1.6 × 10 33 cm L^(**)=1.6 xx10^(-33)cmL^{*}=1.6 \times 10^{-33} \mathrm{~cm}L=1.6×1033 cm.
d. The classically allowed history receives "preference without preference." That history, and histories H H HHH that differ from it so little that δ I = I H I class δ I = I H I class  delta I=I_(H)-I_("class ")\delta I=I_{H}-I_{\text {class }}δI=IHIclass  is only of the order \hbar and less, give contributions to the probability amplitude that interfere constructively. In contrast, destructive interference effectively wipes out the contribution (to the probability amplitude for a transition) that comes from histories that differ more from the classically allowed history. Thus there are quantum fluctuations in the geometry, but they are fluctuations of limited magnitude. The smallness of \hbar ensures that the scale of these fluctuations is unnoticeable at everyday distances (see the further discussion in Chapters 43 and 44). In this sense classical geometrodynamics is a good approximation to the geometrodynamics of the real world of quantum physics.
3. "Physics on a spacelike slice or hypersurface of simultaneity," again with electromagnetism as the model.
a. Say over and over "lines of magnetic force never end" and come out with half of Maxwell's equations. Say over and over "lines of electric force end
Box 17.2 (continued)
only on charge" and arrive at the other half of Maxwell's equations. Similarly, say over and over
(8) ( intrinsic curvature scalar ) + ( extrinsic curvature scalar ) = 16 π ( local density of mass- energy ) (8)  intrinsic   curvature   scalar  +  extrinsic   curvature   scalar  = 16 π  local density   of mass-   energy  {:(8)([" intrinsic "],[" curvature "],[" scalar "])+([" extrinsic "],[" curvature "],[" scalar "])=16 pi([" local density "],[" of mass- "],[" energy "]):}\left(\begin{array}{l} \text { intrinsic } \tag{8}\\ \text { curvature } \\ \text { scalar } \end{array}\right)+\left(\begin{array}{l} \text { extrinsic } \\ \text { curvature } \\ \text { scalar } \end{array}\right)=16 \pi\left(\begin{array}{l} \text { local density } \\ \text { of mass- } \\ \text { energy } \end{array}\right)(8)( intrinsic  curvature  scalar )+( extrinsic  curvature  scalar )=16π( local density  of mass-  energy )
and end up with all ten components of Einstein's equation. To "say over and over" is an abbreviation for demanding that the stated principles hold on every spacelike slice through every event of spacetime.
b. Spell out explicitly this "spacelike-slice formulation" of the equations of Maxwell and Einstein. Consider an arbitrary point of spacetime, P P P\mathscr{P}P ("event"), and an arbitrary "simultaneity" S S SSS through P P P\mathscr{P}P (hypersurface of simultaneity; spacelike slice through spacetime). Magnetic lines of force run about throughout S S S\mathcal{S}S, but nowhere is even a single one of them permitted to end. Recall ( § 3.4 § 3.4 §3.4\S 3.4§3.4 ) that the demand "lines of magnetic force never end," when imposed on all reference frames at P P P\mathscr{P}P (for all choices of the "simultaneity" S S S\mathcal{S}S ), guarantees not only B = 0 B = 0 grad*B=0\boldsymbol{\nabla} \cdot \boldsymbol{B}=0B=0, but also × E + B / t = 0 × E + B / t = 0 grad xx E+del B//del t=0\boldsymbol{\nabla} \times \boldsymbol{E}+\partial \boldsymbol{B} / \partial t=0×E+B/t=0. Similarly (§3.4) the demand that "electric lines of force never end except on electric charge," E = 4 π J 0 E = 4 π J 0 grad*E=4piJ^(0)\boldsymbol{\nabla} \cdot \boldsymbol{E}=4 \pi J^{0}E=4πJ0, when imposed on all "simultaneities" through P P P\mathscr{P}P, guarantees the remaining Maxwell equation × B = E / t + 4 π J × B = E / t + 4 π J grad xx B=del E//del t+4pi J\boldsymbol{\nabla} \times \boldsymbol{B}=\partial \boldsymbol{E} / \partial t+4 \pi \boldsymbol{J}×B=E/t+4πJ.
c. Each simultaneity S S SSS through P P P\mathscr{P}P has its own slope and curvature. The possibility of different slopes (different local Lorentz frames at P P P\mathscr{P}P ) is essential for deriving all of Maxwell's equations from the requirements of conservation of flux. Relevant though the slope thus is, the curvature of the hypersurface S S SSS never matters for the analysis of electromagnetism. It does matter, however, for any analysis of gravitation modeled on the foregoing treatment of electromagnetism.

"Simultaneity" S S SSS (spacelike hypersurface or "slice through spacetime") that cuts through event P P P\mathscr{P}P. The "simultaneity" may be considered to be defined by a set of "observers" a,b,c, . . . . Their world lines cross the simultaneity orthogonally, and their clocks all read the same proper time at the instant of crossing. Another simultaneity through P P P\mathscr{P}P may have at P P P\mathscr{P}P a different curvature or a different slope or both; and it is defined by a different band of observers, with other wrist watches.
d. "Mass-energy curves space" is the central principle of gravitation. To spell out this principle requires one to examine in succession the terms "space" and "curvature of space" and "density of mass-energy in a given region of space." "Space" means spacelike hypersurface; or, more specifically, a hypersurface of simultaneity S S SSS that includes the point P P P\mathscr{P}P where the physics is under examination.
e. Denote by u u u\boldsymbol{u}u the 4 -vector normal to S S S\mathcal{S}S at P P P\mathscr{P}P. Then the density of mass-energy in the spacelike hypersurface S S SSS at P P P\mathscr{P}P is
(9) ρ = u α T α β u β , (9) ρ = u α T α β u β , {:(9)rho=u^(alpha)T_(alpha beta)u^(beta)",":}\begin{equation*} \rho=u^{\alpha} T_{\alpha \beta} u^{\beta}, \tag{9} \end{equation*}(9)ρ=uαTαβuβ,
in accordance with the definition of the stress-energy tensor given in Chapter 5.
f. This density is a single number, dependent on the inclination of the slice one cuts through spacetime, but independent of how curved one cuts this slice. If it is to be equated to "curvature of space," that curvature must also be independent of how curved one cuts the slice.
g. Conclude that the geometric quantity, "curvature of space," must (1) be a single number (a scalar) that (2) depends on the inclination u u u\boldsymbol{u}u of the cut one makes through spacetime at P P P\mathscr{P}P in constructing the hypersurface S S S\mathcal{S}S, but (3) must be unaffected by how one curves his cut. The demand made here appears paradoxical. One seems to be asking for a measure of curvature that is independent of curvature!
h. A closer look discloses that three distinct ideas come into consideration here. One is the scalar curvature invariant ( 3 ) R ( 3 ) R ^((3))R{ }^{(3)} R(3)R of the 3 -geometry intrinsic to the hypersurface S S SSS at P P P\mathscr{P}P : "intrinsic" in the sense that it is defined by, and depends exclusively on, measurements of distance made within the hypersurface. The second is the "extrinsic curvature" of this 3-geometry relative to the 4-geometry of the enveloping spacetime ("how curved one cuts his slice"; see Box 14.1 for more on the distinction between extrinsic and intrinsic curvature). The third is the curvature of the four-dimensional spacetime itself, "normal to u u u\boldsymbol{u}u," in some sense yet to be more closely defined. This is the quantity that is independent of how curved one cuts his slice. It is the quantity that is to be identified, up to a factor that depends on the choice of units, with the density of mass-energy.

Box 17.2 (continued)

i. These three quantities are related in the following way:
a correction term that (a) depends only on the "extrinsic curvature" K α β K α β K_(alpha beta)K_{\alpha \beta}Kαβ (Box 14.1 and Chapter 21) of the hypersurface relative to the four-dimensional geometry in which it is imbedded, and (b) is so calculated (a uniquely determinate calculation) that the sum of this correction term and ( 3 ) R ( 3 ) R ^((3))R{ }^{(3)} R(3)R is independent of "how curved one cuts his slice," and (c) has the precise value
( Tr K ) 2 Tr K 2 ( K α α ) 2 K α β K α β ) ( Tr K ) 2 Tr K 2 K α α 2 K α β K α β {:(Tr K)^(2)-TrK^(2)-=(K_(alpha)^(alpha))^(2)-K_(alpha beta)K^(alpha beta))\left.(\operatorname{Tr} \boldsymbol{K})^{2}-\operatorname{Tr} \boldsymbol{K}^{2} \equiv\left(K_{\alpha}{ }^{\alpha}\right)^{2}-K_{\alpha \beta} K^{\alpha \beta}\right)(TrK)2TrK2(Kαα)2KαβKαβ)
( scalar curvature invariant, ( 3 ) R , of the 3-geometry intrinsic to the spacelike hypersurface S , a quantity dependent on "how curved one cuts the slice" ) +  scalar curvature invariant,  ( 3 ) R , of the 3-geometry   intrinsic to the spacelike   hypersurface  S , a quantity   dependent on "how curved   one cuts the slice"  + ([" scalar curvature invariant, "],[^((3))R", of the 3-geometry "],[" intrinsic to the spacelike "],[" hypersurface "S", a quantity "],[" dependent on "how curved "],[" one cuts the slice" "])+\left(\begin{array}{l}\text { scalar curvature invariant, } \\ { }^{(3)} R \text {, of the 3-geometry } \\ \text { intrinsic to the spacelike } \\ \text { hypersurface } S \text {, a quantity } \\ \text { dependent on "how curved } \\ \text { one cuts the slice" }\end{array}\right)+( scalar curvature invariant, (3)R, of the 3-geometry  intrinsic to the spacelike  hypersurface S, a quantity  dependent on "how curved  one cuts the slice" )+
= ( a measure of the curvature of spacetime that depends on the 4-geemetry of the spacetime and on the inclination u of the spacelike slice S cut through spacetime, but is independent, by construc- tion, of "how curved one cuts the slice" ) = ( a scalar quantity that (a) is completely defined by what has just been said and (b) can there- fore be calculated in all completeness by standard differential geometry (details in Chapter 21) ) =  a measure of the curvature   of spacetime that depends   on the 4-geemetry of   the spacetime and on   the inclination  u  of the   spacelike slice  S  cut   through spacetime, but is   independent, by construc-   tion, of "how curved one   cuts the slice"  =  a scalar quantity that   (a) is completely defined   by what has just been   said and (b) can there-   fore be calculated in all   completeness by standard   differential geometry   (details in Chapter 21)  =([" a measure of the curvature "],[" of spacetime that depends "],[" on the 4-geemetry of "],[" the spacetime and on "],[" the inclination "u" of the "],[" spacelike slice "S" cut "],[" through spacetime, but is "],[" independent, by construc- "],[" tion, of "how curved one "],[" cuts the slice" "])=([" a scalar quantity that "],[" (a) is completely defined "],[" by what has just been "],[" said and (b) can there- "],[" fore be calculated in all "],[" completeness by standard "],[" differential geometry "],[" (details in Chapter 21) "])=\left(\begin{array}{l}\text { a measure of the curvature } \\ \text { of spacetime that depends } \\ \text { on the 4-geemetry of } \\ \text { the spacetime and on } \\ \text { the inclination } \boldsymbol{u} \text { of the } \\ \text { spacelike slice } S \text { cut } \\ \text { through spacetime, but is } \\ \text { independent, by construc- } \\ \text { tion, of "how curved one } \\ \text { cuts the slice" }\end{array}\right)=\left(\begin{array}{l}\text { a scalar quantity that } \\ \text { (a) is completely defined } \\ \text { by what has just been } \\ \text { said and (b) can there- } \\ \text { fore be calculated in all } \\ \text { completeness by standard } \\ \text { differential geometry } \\ \text { (details in Chapter 21) }\end{array}\right)=( a measure of the curvature  of spacetime that depends  on the 4-geemetry of  the spacetime and on  the inclination u of the  spacelike slice S cut  through spacetime, but is  independent, by construc-  tion, of "how curved one  cuts the slice" )=( a scalar quantity that  (a) is completely defined  by what has just been  said and (b) can there-  fore be calculated in all  completeness by standard  differential geometry  (details in Chapter 21) )
(10) = ( 2 u α G α β u β , where G α β is the Einstein curvature tensor of equation 8.49 and Box 8.6 ) = 2 ( a quantity interpreted in Track 2, Chapter 15, as the "moment of rotation" asso- ciated with a unit element of 3-volume located at P in the hypersurface orth- ogonal to u ) (10) = 2 u α G α β u β ,  where  G α β  is   the Einstein curvature   tensor of equation  8.49  and Box  8.6 = 2  a quantity interpreted in   Track 2, Chapter 15, as the   "moment of rotation" asso-   ciated with a unit element   of 3-volume located at  P  in the hypersurface orth-   ogonal to  u {:(10)=([2u^(alpha)G_(alpha beta)u^(beta)","" where "G_(alpha beta)" is "],[" the Einstein curvature "],[" tensor of equation "8.49],[" and Box "8.6])=2([" a quantity interpreted in "],[" Track 2, Chapter 15, as the "],[" "moment of rotation" asso- "],[" ciated with a unit element "],[" of 3-volume located at "P],[" in the hypersurface orth- "],[" ogonal to "u]):}=\left(\begin{array}{l}2 u^{\alpha} G_{\alpha \beta} u^{\beta}, \text { where } G_{\alpha \beta} \text { is } \tag{10}\\ \text { the Einstein curvature } \\ \text { tensor of equation } 8.49 \\ \text { and Box } 8.6\end{array}\right)=2\left(\begin{array}{l}\text { a quantity interpreted in } \\ \text { Track 2, Chapter 15, as the } \\ \text { "moment of rotation" asso- } \\ \text { ciated with a unit element } \\ \text { of 3-volume located at } \mathscr{P} \\ \text { in the hypersurface orth- } \\ \text { ogonal to } \boldsymbol{u}\end{array}\right)(10)=(2uαGαβuβ, where Gαβ is  the Einstein curvature  tensor of equation 8.49 and Box 8.6)=2( a quantity interpreted in  Track 2, Chapter 15, as the  "moment of rotation" asso-  ciated with a unit element  of 3-volume located at P in the hypersurface orth-  ogonal to u)
j. Conclude that the central principle, "mass-energy curves space," translates to the formula
(11) ( 3 ) R + ( Tr K ) 2 Tr K 2 = 16 π ρ , (11) ( 3 ) R + ( Tr K ) 2 Tr K 2 = 16 π ρ , {:(11)^((3))R+(Tr K)^(2)-TrK^(2)=16 pi rho",":}\begin{equation*} { }^{(3)} R+(\operatorname{Tr} \boldsymbol{K})^{2}-\operatorname{Tr} \boldsymbol{K}^{2}=16 \pi \rho, \tag{11} \end{equation*}(11)(3)R+(TrK)2TrK2=16πρ,
or, in shorthand form,
(12) ( moment of rotation ) = ( intrinsic curvature ) + ( extrinsic curvature ) = ( density of mass-energy ) , (12) (  moment of   rotation  ) = (  intrinsic   curvature  ) + (  extrinsic   curvature  ) = (  density of   mass-energy  ) , {:(12)((" moment of ")/(" rotation "))=((" intrinsic ")/(" curvature "))+((" extrinsic ")/(" curvature "))=((" density of ")/(" mass-energy "))",":}\begin{equation*} \binom{\text { moment of }}{\text { rotation }}=\binom{\text { intrinsic }}{\text { curvature }}+\binom{\text { extrinsic }}{\text { curvature }}=\binom{\text { density of }}{\text { mass-energy }}, \tag{12} \end{equation*}(12)( moment of  rotation )=( intrinsic  curvature )+( extrinsic  curvature )=( density of  mass-energy ),
valid for every spacelike slice through spacetime at any arbitrary point P P P\mathscr{P}P.
k. All of Einstein's geometrodynamics is contained in this statement as truly as all of Maxwell's electrodynamics is contained in the statement that the number of lines of force that end in an element of volume is equal to 4 π 4 π 4pi4 \pi4π times the amount of charge in that element of volume. The factor 16 π 16 π 16 pi16 \pi16π is appropriate for the geometric system of units in use in this book (density ρ ρ rho\rhoρ in cm 2 cm 2 cm^(-2)\mathrm{cm}^{-2}cm2 given by G / c 2 = 0.742 × 10 28 cm / g G / c 2 = 0.742 × 10 28 cm / g G//c^(2)=0.742 xx10^(-28)cm//gG / c^{2}=0.742 \times 10^{-28} \mathrm{~cm} / \mathrm{g}G/c2=0.742×1028 cm/g multiplied by the density ρ conv ρ conv  rho_("conv ")\rho_{\text {conv }}ρconv  expressed in the conventional units of g / cm 3 g / cm 3 g//cm^(3)\mathrm{g} / \mathrm{cm}^{3}g/cm3 ).
  1. Reexpress the principle that "mass-energy curves space" in the form
(13) 2 u α G α β u β = 16 π u α T α β u β . (13) 2 u α G α β u β = 16 π u α T α β u β . {:(13)2u^(alpha)G_(alpha beta)u^(beta)=16 piu^(alpha)T_(alpha beta)u^(beta).:}\begin{equation*} 2 u^{\alpha} G_{\alpha \beta} u^{\beta}=16 \pi u^{\alpha} T_{\alpha \beta} u^{\beta} . \tag{13} \end{equation*}(13)2uαGαβuβ=16πuαTαβuβ.
Demand that this equation should hold for every simultaneity that cuts through P P P\mathscr{P}P, whatever its "inclination" u u u\boldsymbol{u}u.
m . Conclude that the coefficients on the two sides of (13) must agree; thus,
(14) G α β = 8 π T α β , (14) G α β = 8 π T α β , {:(14)G_(alpha beta)=8piT_(alpha beta)",":}\begin{equation*} G_{\alpha \beta}=8 \pi T_{\alpha \beta}, \tag{14} \end{equation*}(14)Gαβ=8πTαβ,
Einstein's equation in the language of components; or, in the language of abstract geometric quantities,
(15) G = 8 π T . (15) G = 8 π T . {:(15)G=8pi T.:}\begin{equation*} \boldsymbol{G}=8 \pi \boldsymbol{T} . \tag{15} \end{equation*}(15)G=8πT.
  1. Going from superspace to Einstein's equation rather than from Einstein's equation to superspace.
    a. A fourth route to Einstein's equation starts with the advanced view of geometrodynamics that is spelled out in Chapter 43. One notes there that the dynamics of geometry unfolds in superspace. Superspace has an infinite number of dimensions. Any one point in superspace describes a complete 3 -geometry, ( 3 ) ( 3 ) ^((3)){ }^{(3)}(3) y, with all its bumps and curvatures. The dynamics of geometry leads from point to point in superspace.
    b. Like the dynamics of a particle, the dynamics of geometry lends itself to distinct but equivalent mathematical formulations, associated with the names of Lagrange, of Hamilton, and of Hamilton and Jacobi. Of these the most convenient for the present analysis is the last ("H-J").
    c. In the problem of one particle moving in one dimension under the influence of a potential V ( x ) V ( x ) V(x)V(x)V(x), the H J H J H-J\mathrm{H}-\mathrm{J}HJ equation reads
(16) S t total energy = 1 2 m ( S x ) 2 kinetic energy + V ( x ) . (16) S t  total   energy  = 1 2 m S x 2  kinetic   energy  + V ( x ) . {:(16)ubrace(-(del S)/(del t)ubrace)_({:[" total "],[" energy "]:})=ubrace((1)/(2m)((del S)/(del x))^(2)ubrace)_({:[" kinetic "],[" energy "]:})+V(x).:}\begin{equation*} \underbrace{-\frac{\partial S}{\partial t}}_{\substack{\text { total } \\ \text { energy }}}=\underbrace{\frac{1}{2 m}\left(\frac{\partial S}{\partial x}\right)^{2}}_{\substack{\text { kinetic } \\ \text { energy }}}+V(x) . \tag{16} \end{equation*}(16)St total  energy =12m(Sx)2 kinetic  energy +V(x).
Box 17.2 (continued)
It has the solution
(17) S E ( x , t ) = E t + x [ 2 m ( E V ) ] 1 / 2 d x (17) S E ( x , t ) = E t + x [ 2 m ( E V ) ] 1 / 2 d x {:(17)S_(E)(x","t)=-Et+int^(x)[2m(E-V)]^(1//2)dx:}\begin{equation*} S_{E}(x, t)=-E t+\int^{x}[2 m(E-V)]^{1 / 2} d x \tag{17} \end{equation*}(17)SE(x,t)=Et+x[2m(EV)]1/2dx
Out of this solution one reads the motion by applying the "condition of constructive interference,"
(18) S E ( x , t ) E = 0 (18) S E ( x , t ) E = 0 {:(18)(delS_(E)(x,t))/(del E)=0:}\begin{equation*} \frac{\partial S_{E}(x, t)}{\partial E}=0 \tag{18} \end{equation*}(18)SE(x,t)E=0
(one equation connecting the two quantities x x xxx and t t ttt; for more on the condition of constructive interference and the H-J method in general, see Boxes 25.3 and 25.4).
d. In the corresponding equation for the dynamics of geometry, one deals with a function S = S ( ( 3 ) ξ ) S = S ( 3 ) ξ S=S(^((3))xi)S=S\left({ }^{(3)} \xi\right)S=S((3)ξ) of the 3-geometry. It depends on the 3-geometry itself, and not on the vagaries of one's choice of coordinates or on the corresponding vagaries in the metric coefficients of the 3-geometry,
(19) d s 2 = ( 3 ) g m n d x m d x n (19) d s 2 = ( 3 ) g m n d x m d x n {:(19)ds^(2)=^((3))g_(mn)dx^(m)dx^(n):}\begin{equation*} d s^{2}={ }^{(3)} g_{m n} d x^{m} d x^{n} \tag{19} \end{equation*}(19)ds2=(3)gmndxmdxn
( 3 ) ( 3 ) ^((3)){ }^{(3)}(3) to indicate 3-geometry omitted hereafter for simplicity). This function obeys the H-J equation [the analog of (16)]
(20) ( 16 π ) 2 1 2 g ( g i m g j n + g i n g j m g i j g m n ) δ S δ g i j δ S δ g m n + ( 3 ) R = 16 π ρ . (20) ( 16 π ) 2 1 2 g g i m g j n + g i n g j m g i j g m n δ S δ g i j δ S δ g m n + ( 3 ) R = 16 π ρ . {:(20)-(16 pi)^(2)(1)/(2g)(g_(im)g_(jn)+g_(in)g_(jm)-g_(ij)g_(mn))(delta S)/(deltag_(ij))(delta S)/(deltag_(mn))+^((3))R=16 pi rho.:}\begin{equation*} -(16 \pi)^{2} \frac{1}{2 g}\left(g_{i m} g_{j n}+g_{i n} g_{j m}-g_{i j} g_{m n}\right) \frac{\delta S}{\delta g_{i j}} \frac{\delta S}{\delta g_{m n}}+{ }^{(3)} R=16 \pi \rho . \tag{20} \end{equation*}(20)(16π)212g(gimgjn+gingjmgijgmn)δSδgijδSδgmn+(3)R=16πρ.
e. Out of this equation for the dynamics of geometry in superspace one can deduce the Einstein field equation by reasoning similar to that employed in going from (17) to (18) (Gerlach 1969).
f. It would appear that one must break new ground, and establish new foundations, if one is to find out how to regard the "Einstein-Hamilton-Jacobi equation" (20) as more basic than the Einstein field equation that one derives from it. [Since done, by Hojman, Kuchař, and Teitelboim (1973 preprint).]
5. Einstein's geometrodynamics viewed as the standard field theory for a field of spin 2 in an "unobservable flat spacetime" background.
a. This approach to Einstein's field equation has a long history, references to which will be found in § 7.1 § 7.1 §7.1\S 7.1§7.1 and § 18.1 § 18.1 §18.1\S 18.1§18.1. (Further discussion of this approach will be found in those two sections and in Box 7.1, exercise 7.3, and Box 18.1).
b. The following summary is quoted from Deser Deser Deser\operatorname{Deser}Deser (1970): "We wish to give a simple physical derivation of the nonlinearity ..., using a now familiar argument . . . leading from the linear, massless, spin-2 field to the full Einstein equations . . . .
c. "The Einstein equations may be derived nongeometrically by noting that the free, massless, spin-2 field equations,
R L μ ν ( ϕ ) 1 2 R L α α ( ϕ ) η μ ν G L μ ν ( ϕ ) [ ( η μ α η ν β η μ ν η α β ) (21) + η μ ν α β + η α β μ ν η μ α ν β η ν β μ α ] ϕ α β = 0 , R L μ ν ( ϕ ) 1 2 R L α α ( ϕ ) η μ ν G L μ ν ( ϕ ) η μ α η ν β η μ ν η α β (21) + η μ ν α β + η α β μ ν η μ α ν β η ν β μ α ϕ α β = 0 , {:[R^(L)_(mu nu)(phi)-(1)/(2)R^(L)_(alpha alpha)(phi)eta_(mu nu)-=G^(L)_(mu nu)(phi)-=[(eta_(mu alpha)eta_(nu beta)-eta_(mu nu)eta_(alpha beta))◻:}],[(21){:+eta_(mu nu)del_(alpha)del_(beta)+eta_(alpha beta)del_(mu)del_(nu)-eta_(mu alpha)del_(nu)del_(beta)-eta_(nu beta)del_(mu)del_(alpha)]phi_(alpha beta)=0","]:}\begin{align*} & R^{L}{ }_{\mu \nu}(\phi)-\frac{1}{2} R^{L}{ }_{\alpha \alpha}(\phi) \eta_{\mu \nu} \equiv G^{L}{ }_{\mu \nu}(\phi) \equiv\left[\left(\eta_{\mu \alpha} \eta_{\nu \beta}-\eta_{\mu \nu} \eta_{\alpha \beta}\right) \square\right. \\ & \left.+\eta_{\mu \nu} \partial_{\alpha} \partial_{\beta}+\eta_{\alpha \beta} \partial_{\mu} \partial_{\nu}-\eta_{\mu \alpha} \partial_{\nu} \partial_{\beta}-\eta_{\nu \beta} \partial_{\mu} \partial_{\alpha}\right] \phi_{\alpha \beta}=0, \tag{21} \end{align*}RLμν(ϕ)12RLαα(ϕ)ημνGLμν(ϕ)[(ημαηνβημνηαβ)(21)+ημναβ+ηαβμνημανβηνβμα]ϕαβ=0,
whose source is the matter stress-tensor T μ ν T μ ν T_(mu nu)T_{\mu \nu}Tμν, must actually be coupled to the total stress-tensor, including that of the ϕ ϕ phi\phiϕ-field itself. That is, while the free-field equations (21) are of course quite consistent as they stand, [they are not] when there is a dynamic system's T μ ν T μ ν T_(mu nu)T_{\mu \nu}Tμν as a source. For then the left side, which is identically divergenceless, is inconsistent with the right, since the coupling implies that T μ ν , ν T μ ν , ν T^(mu nu)_(,nu)T^{\mu \nu}{ }_{, \nu}Tμν,ν, as computed from the matter equations of motion, is no longer conserved.
d. "To remedy this [violation of the principle of conservation of momentum and energy], the stress tensor ( 2 ) θ μ ν ( 2 ) θ μ ν ^((2))theta_(mu nu){ }^{(2)} \theta_{\mu \nu}(2)θμν arising from the quadratic Lagrangian ( 2 ) L ( 2 ) L ^((2))L{ }^{(2)} L(2)L responsible for equation (21) is then inserted on the right.
e. "But the Lagrangian ( 3 ) L ( 3 ) L ^((3))L{ }^{(3)} L(3)L leading to these modified equations is then cubic, and itself contributes a cubic ( 3 ) θ μ ν ( 3 ) θ μ ν ^((3))theta_(mu nu){ }^{(3)} \theta_{\mu \nu}(3)θμν.
f. "This series continues indefinitely, and sums (if properly derived!) to the full nonlinear Einstein equations, G μ ν G μ ν G_(mu nu)G_{\mu \nu}Gμν ([calculated from] η α β + ϕ α β ) = κ T μ ν η α β + ϕ α β = κ T μ ν {:eta_(alpha beta)+phi_(alpha beta))=-kappaT_(mu nu)\left.\eta_{\alpha \beta}+\phi_{\alpha \beta}\right)=-\kappa T_{\mu \nu}ηαβ+ϕαβ)=κTμν [ + 8 π T μ ν + 8 π T μ ν +8piT_(mu nu)+8 \pi T_{\mu \nu}+8πTμν in the geometric units and sign conventions of this book], which are an infinite series in the deviation ϕ μ ν ϕ μ ν phi_(mu nu)\phi_{\mu \nu}ϕμν of the metric g μ ν g μ ν g_(mu nu)g_{\mu \nu}gμν from its Minkowskian value η μ ν η μ ν eta_(mu nu)\eta_{\mu \nu}ημν.
g. Once the iteration is begun (whether or not a T μ ν T μ ν T_(mu nu)T_{\mu \nu}Tμν is actually present), it must be continued to all orders, since conservation only holds for the full series
n = 2 ( n ) θ μ ν n = 2 ( n ) θ μ ν sum_(n=2)^(oo)^((n))theta_(mu nu)\sum_{n=2}^{\infty}{ }^{(n)} \theta_{\mu \nu}n=2(n)θμν. Thus, the theory is either left in its (physically irrelevant) free linear form (21), or it must be an infinite series."
h. For details, see Deser (1970); the paper goes on (1) to take advantage of a well-chosen formalism (2) to rearrange the calculation, and thus (3) to "derive the full Einstein equations, on the basis of the same self-coupling requirement, but with the advantages that the full theory emerges in closed form with just one added (cubic) term, rather than as an infinite series."
i. Deser summarizes the analysis at the end thus: "Consistency has therefore led us to universal coupling, which implies the equivalence principle. It is at this point that the geometric interpretation of general relativity arises, since all matter now moves in an effective Riemann space of metric g μ ν η μ ν + h μ ν g μ ν η μ ν + h μ ν g^(mu nu)-=eta^(mu nu)+h^(mu nu)\mathfrak{g}^{\mu \nu} \equiv \eta^{\mu \nu}+h^{\mu \nu}gμνημν+hμν. ... [The] initial flat 'background' space is no longer observable." In other words, this approach to Einstein's field equation can be summarized as "curvature without curvature" or-equally well-as "flat spacetime without flat spacetime"!

Box 17.2 (continued)

  1. Sakharov's view of gravitation as an elasticity of space that arises from particle physics.
    a. The resistance of a homogeneous isotropic solid to deformation is described by two elastic constants, Young's modulus and Poisson's ratio.
    b. The resistance of space to deformation is described by one elastic constant, the Newtonian constant of gravity. It makes its appearance in the action principle of Hilbert
I = 1 16 π G ( 4 ) R ( g ) 1 / 2 d 4 x (22) + ( L matter + L fields ) ( g ) 1 / 2 d 4 x = extremum I = 1 16 π G ( 4 ) R ( g ) 1 / 2 d 4 x (22) + L matter  + L fields  ( g ) 1 / 2 d 4 x =  extremum  {:[I=(1)/(16 pi G)int^((4))R(-g)^(1//2)d^(4)x],[(22)+int(L_("matter ")+L_("fields "))(-g)^(1//2)d^(4)x=" extremum "]:}\begin{align*} I= & \frac{1}{16 \pi G} \int{ }^{(4)} R(-g)^{1 / 2} d^{4} x \\ & +\int\left(L_{\text {matter }}+L_{\text {fields }}\right)(-g)^{1 / 2} d^{4} x=\text { extremum } \tag{22} \end{align*}I=116πG(4)R(g)1/2d4x(22)+(Lmatter +Lfields )(g)1/2d4x= extremum 
c. According to the historical records, it was first learned how many elastic constants it takes to describe a solid from microscopic molecular models of matter (Newton, Laplace, Navier, Cauchy, Poisson, Voigt, Kelvin, Born), not from macroscopic considerations of symmetry and invariance. Thus, count the energy stored up in molecular bonds that are deformed from natural length or natural angle or both. Arrive at an expression for the energy of deformation per unit volume of the elastic material of the form
(23) e = A ( Tr s ) 2 + B Tr ( s 2 ) (23) e = A ( Tr s ) 2 + B Tr s 2 {:(23)e=A(Tr s)^(2)+B Tr(s^(2)):}\begin{equation*} e=A(\operatorname{Tr} \boldsymbol{s})^{2}+B \operatorname{Tr}\left(\boldsymbol{s}^{2}\right) \tag{23} \end{equation*}(23)e=A(Trs)2+BTr(s2)
Here the strain tensor
(24) s m n = 1 2 ( ξ m x n + ξ n x m ) (24) s m n = 1 2 ξ m x n + ξ n x m {:(24)s_(mn)=(1)/(2)((delxi_(m))/(delx^(n))+(delxi_(n))/(delx^(m))):}\begin{equation*} s_{m n}=\frac{1}{2}\left(\frac{\partial \xi_{m}}{\partial x^{n}}+\frac{\partial \xi_{n}}{\partial x^{m}}\right) \tag{24} \end{equation*}(24)smn=12(ξmxn+ξnxm)
measures the strain produced in the elastic medium by motion of the typical point that was at the location x m x m x^(m)x^{m}xm to the location x m + ξ m ( x ) x m + ξ m ( x ) x^(m)+xi^(m)(x)x^{m}+\xi^{m}(x)xm+ξm(x). The constants A A AAA and B B BBB are derived out of microscopic physics. They fix the values of the two elastic constants of the macroscopic theory of elasticity.
d. Andrei Sakharov (1967) (the Andrei Sakharov) has proposed a similar microscopic foundation for gravitation or, as he calls it, the "metric elasticity of space." He identifies the action term of Einstein's geometrodynamics [the first term in (22)] "with the change in the action of quantum fluctuations of the vacuum [associated with the physics of particles and fields and brought about] when space is curved."
e. Sakharov notes that present-day quantum field theory "gets rid by a renormalization process" of an energy density in the vacuum that would formally be infinite if not removed by this renormalization. Thus, in the standard analysis of the degrees of freedom of the electromagnetic field in flat space, one counts the number of modes of vibration per unit volume in the range
of circular wave numbers from k k kkk to k + d k k + d k k+dkk+d kk+dk as ( 2 4 π / 8 π 3 ) k 2 d k 2 4 π / 8 π 3 k 2 d k (2*4pi//8pi^(3))k^(2)dk\left(2 \cdot 4 \pi / 8 \pi^{3}\right) k^{2} d k(24π/8π3)k2dk. Each mode of oscillation, even at the absolute zero of temperature, has an absolute irreducible minimum of "zero-point energy of oscillation," 1 2 h ν = 1 2 c k 1 2 h ν = 1 2 c k (1)/(2)h nu=(1)/(2)ℏck\frac{1}{2} h \nu=\frac{1}{2} \hbar c k12hν=12ck [the fluctuating electric field associated with which is among the most firmly established of all physical effects. It acts on the electron in the hydrogen atom in supplement to the electric field caused by the proton alone, and thereby produces most of the famous Lamb-Retherford shift in the energy levels of the hydrogen atom, as made especially clear by Welton (1948) and Dyson (1954)]. The totalized density of zero-point energy of the electromagnetic field per unit volume of spacetime (units: cm 4 cm 4 cm^(4)\mathrm{cm}^{4}cm4 ) formally diverges as
(25) ( / 2 π 2 ) 0 k 3 d k (25) / 2 π 2 0 k 3 d k {:(25)(ℏ//2pi^(2))int_(0)^(oo)k^(3)dk:}\begin{equation*} \left(\hbar / 2 \pi^{2}\right) \int_{0}^{\infty} k^{3} d k \tag{25} \end{equation*}(25)(/2π2)0k3dk
Equally formally this divergence is "removed" by "renormalization" [for more on renormalization see, for example, Hepp (1969)].
f. Similar divergences appear when one counts up formally the energy associated with other fields and with vacuum fluctuations in number of pairs of electrons, μ μ mu\muμ-mesons, and other particles in the limit of quantum energies large in comparison with the rest mass of any of these particles. Again these divergences in formal calculations are "removed by renormalization."
g. Removed by renormalization is a contribution not only to the energy density, and therefore to the stress-energy tensor, but also to the total Lagrange function E E E\mathcal{E}E of the variational principle for all these fields and particles,
(26) I = E d 4 x = extremum (26) I = E d 4 x =  extremum  {:(26)I=intEd^(4)x=" extremum ":}\begin{equation*} I=\int \mathcal{E} d^{4} x=\text { extremum } \tag{26} \end{equation*}(26)I=Ed4x= extremum 
h. Curving spacetime alters all these energies, Sakharov points out, extending an argument of Zel'dovich (1967). Therefore the process of "renormalization" or "subtraction" no longer gives zero. Instead, the contribution of zero-point energies to the Lagrangian, expanded as a power series in powers of the curvature, with numerical coefficients A , B , A , B , A,B,dotsA, B, \ldotsA,B, of the order of magnitude of unity, takes a form simplified by Ruzmaikina and Ruzmaikin (1969) to the following:
L ( R ) = A k 3 d k + B ( 4 ) R k d k + [ C ( ( 4 ) R ) 2 + D R α β R α β ] k 1 d k (27) + (higher-order terms) . L ( R ) = A k 3 d k + B ( 4 ) R k d k + C ( 4 ) R 2 + D R α β R α β k 1 d k (27) +  (higher-order terms)  . {:[L(R)=Aℏintk^(3)dk+Bℏ^((4))R int kdk],[+ℏ[C(^((4))R)^(2)+DR^(alpha beta)R_(alpha beta)]intk^(-1)dk],[(27)+" (higher-order terms) ".]:}\begin{align*} \mathcal{L}(R)= & A \hbar \int k^{3} d k+B \hbar^{(4)} R \int k d k \\ & +\hbar\left[C\left({ }^{(4)} R\right)^{2}+D R^{\alpha \beta} R_{\alpha \beta}\right] \int k^{-1} d k \\ & + \text { (higher-order terms) } . \tag{27} \end{align*}L(R)=Ak3dk+B(4)Rkdk+[C((4)R)2+DRαβRαβ]k1dk(27)+ (higher-order terms) .
[For the alteration in the number of standing waves per unit frequency in a curved manifold, see also Berger (1966), Sakharov (1967), Hill in De Witt (1967c), Polievktov-Nikoladze (1969), and Berger, Gauduchon, and Mazet (1971).]
i. Renormalization physics argues that the first term in (27) is to be dropped. The second term, Sakharov notes, is identical in form to the Hilbert action

Box 17.2 (continued)

principle, equation (3) above, with the exception that there the constant that multiplies the Riemann scalar curvature invariant is c 3 / 16 π G c 3 / 16 π G -c^(3)//16 pi G-c^{3} / 16 \pi Gc3/16πG (in conventional units), whereas here it is B j k d k B j k d k BℏjkdkB \hbar j k d kBjkdk (in the same conventional units). The higher order terms in (27) lead to what Sakharov calls "corrections . . . to Einstein's equations."
j. Overlooking these corrections, one evidently obtains the action principle of Einstein's theory when one insists on the equality
(28) G = ( Newtonian constant of gravity ) = c 3 16 π B j k d k (28) G = (  Newtonian   constant of gravity  ) = c 3 16 π B j k d k {:(28)G=((" Newtonian ")/(" constant of gravity "))=(c^(3))/(16 pi Bℏjkdk):}\begin{equation*} G=\binom{\text { Newtonian }}{\text { constant of gravity }}=\frac{c^{3}}{16 \pi B \hbar j k d k} \tag{28} \end{equation*}(28)G=( Newtonian  constant of gravity )=c316πBjkdk
With B B BBB a dimensionless numerical factor of the order of unity, it follows, Sakharov argues, that the effective upper limit or "cutoff" in the formally divergent integral in (28) is to be taken to be of the order of magnitude of the reciprocal Planck length [see equation (7)],
(29) k cut off ( c 3 / G ) 1 / 2 = 1 / L = 1 / 1.6 × 10 33 cm (29) k cut off  c 3 / G 1 / 2 = 1 / L = 1 / 1.6 × 10 33 cm {:(29)k_("cut off ")∼(c^(3)//ℏG)^(1//2)=1//L^(**)=1//1.6 xx10^(-33)cm:}\begin{equation*} k_{\text {cut off }} \sim\left(c^{3} / \hbar G\right)^{1 / 2}=1 / L^{*}=1 / 1.6 \times 10^{-33} \mathrm{~cm} \tag{29} \end{equation*}(29)kcut off (c3/G)1/2=1/L=1/1.6×1033 cm
In effect Sakharov is saying (1) that field physics suffers a sea change into something new and strange for wavelengths less than the Planck length, and for quantum energies of the order of c k cutoff 10 28 eV c k cutoff  10 28 eV ℏck_("cutoff ")∼10^(28)eV\hbar c k_{\text {cutoff }} \sim 10^{28} \mathrm{eV}ckcutoff 1028eV or 10 5 g 10 5 g 10^(-5)g10^{-5} \mathrm{~g}105 g or more; (2) that in consequence the integral k d k k d k int kdk\int k d kkdk is cut off; and (3) that the value of this cutoff, arising purely out of the physics of fields and particles, governs the value of the Newtonian constant of gravity, G G GGG.
k. In this sense, Sakharov's analysis suggests that gravitation is to particle physics as elasticity is to chemical physics: merely a statistical measure of residual energies. In the one case, molecular bindings depend on departures of mole-cule-molecule bond lengths from standard values. In the other case, particle energies are affected by curvatures of the geometry.
  1. Elasticity, which looks simple, gets its explanation from molecular bindings, which are complicated; but molecular bindings, which are complicated, receive their explanation in terms of Schrödinger's wave equation and Coulomb's law of force between charged point-masses, which are even simpler than elasticity.
    m . Einstein's geometrodynamics, which looks simple, is interpreted by Sakharov as a correction term in particle physics, which is complicated. Is particle physics, which is complicated, destined some day in its turn to unravel into something simple-something far deeper and far simpler than geometry ("pregeometry"; Chapter 44)?

§17.6. '"NO PRIOR GEOMETRY': A FEATURE DISTINGUISHING EINSTEIN'S THEORY FROM OTHER THEORIES OF GRAVITY

Whereas Einstein's theory of gravity is exceedingly compelling, one can readily construct less compelling and less elegant alternative theories. The physics literature is replete with examples [see Ni (1972), and Thorne, Ni , and Will (1971) for reviews]. However, when placed among its competitors, Einstein's theory stands out sharp and clear: it agrees with experiment; most of its competitors do not (Chapters 38-40). It describes gravity entirely in terms of geometry; most of its competitors do not. It is free of any "prior geometry"; most of its competitors are not.
Set aside, until Chapter 38, the issue of agreement with experiment. Einstein's theory remains unique. Every other theory either introduces auxiliary gravitational fields [e.g., the scalar field of Brans and Dicke (1961)], or involves "prior geometry," or both. Thus, every other theory is more complicated conceptually than Einstein's theory. Every other theory contains elements of complexity for which there is no experimental motivation.
The concept of "prior geometry" requires elucidation, not least because the rejection of prior geometry played a key role in the reasoning that originally led Einstein to his geometrodynamic equation G = 8 π T G = 8 π T G=8pi T\boldsymbol{G}=8 \pi \boldsymbol{T}G=8πT. By "prior geometry" one means any aspect of the geometry of spacetime that is fixed immutably, i.e., that cannot be changed by changing the distribution of gravitating sources. Thus, prior geometry is not generated by or affected by matter; it is not dynamic. Example: Nordstrøm (1913) formulated a theory in which the physical metric of spacetime g g g\boldsymbol{g}g (the metric that enters into the equivalence principle) is generated by a "background" flat-spacetime metric η η eta\boldsymbol{\eta}η, and by a scalar gravitational field ϕ ϕ phi\phiϕ :
(17.23b) η α β ϕ , α β = 4 π ϕ η α β T α β ( generation of ϕ by stress-energy ) , g α β = ϕ 2 η α β ( construction of g from ϕ and η ) . (17.23b) η α β ϕ , α β = 4 π ϕ η α β T α β (  generation of  ϕ  by   stress-energy  ) , g α β = ϕ 2 η α β (  construction of  g  from  ϕ  and  η ) . {:(17.23b){:[eta^(alpha beta)phi_(,alpha beta)=-4pi phieta^(alpha beta)T_(alpha beta),((" generation of "phi" by ")/(" stress-energy "))","],[g_(alpha beta)=phi^(2)eta_(alpha beta),((" construction of "g)/(" from "phi" and "eta)).]:}:}\begin{array}{cc} \eta^{\alpha \beta} \phi_{, \alpha \beta}=-4 \pi \phi \eta^{\alpha \beta} T_{\alpha \beta} & \binom{\text { generation of } \phi \text { by }}{\text { stress-energy }}, \\ g_{\alpha \beta}=\phi^{2} \eta_{\alpha \beta} & \binom{\text { construction of } \boldsymbol{g}}{\text { from } \phi \text { and } \boldsymbol{\eta}} . \tag{17.23b} \end{array}(17.23b)ηαβϕ,αβ=4πϕηαβTαβ( generation of ϕ by  stress-energy ),gαβ=ϕ2ηαβ( construction of g from ϕ and η).
In this theory, the physical metric g g g\boldsymbol{g}g (governor of rods and clocks and of test-particle motion) has but one changeable degree of freedom-the freedom in ϕ ϕ phi\phiϕ. The rest of g g g\boldsymbol{g}g is fixed by the flat spacetime metric ("prior geometry") η η eta\boldsymbol{\eta}η. One does not remove the prior geometry by rewriting Nordstrom's equations (17.23) in a form

devoid of reference to η η eta\boldsymbol{\eta}η and ϕ ϕ phi\phiϕ [Einstein and Fokker (1914); exercise 17.8]. Mass can still influence only one degree of freedom in the spacetime geometry. The other degrees of freedom are fixed à priori-they are prior geometry. And this prior geometry can perfectly well (in principle) be detected by physical experiments that make no reference to any equations (Box 17.3).
Einstein's theory compared with other theories of gravity
All other theories introduce auxiliary gravitational fields or prior geometry
"Prior geometry" defined
Nordstrom's theory as an illustration of prior geometry

Box 17.3 AN EXPERIMENT TO DETECT OR EXCLUDE CERTAIN TYPES OF PRIOR GEOMETRY

(Based on December 1970 discussions between Alfred Schild and Charles W. Misner)
Choose a momentarily static universe populated with a large supply of suitable pulsars. The pulsars should be absolutely regular, periodically emitting characteristic pulses of both gravitational and electromagnetic waves.
Two fleets of spaceships containing receivers are sent out "on station" to collect the experimental data. Admiral Weber's fleet carries gravitationalwave receivers; Admiral Hertz's fleet, electromagnetic receivers. The captain of each spaceship holds himself "on station" by monitoring three suitably chosen pulsars (of identical frequency) and maneuvering so that their pulses always arrive in coincidence. The experimental data he collects consist of the pulses received from all other pulsars, which he is not using for station keeping, each registered as coincident with or interlaced among the reference (stationary) pulses. [For display purposes, the pattern produced by any single pulsar can be converted to acoustic form. The reference pulses can be played acoustically (by the dataprocessing computer) on one drum at a fixed rate, and the pulses from other pulsars can be played on a second drum. A pattern of rythmic beats will result.]
When the data fleet is checked out and tuned up, each captain reports stationary patterns. Now the experiment begins. One or more massive stars are towed in among the fleet. The fleet reacts to stay on station, and reports changes in the data patterns. The spaceships on the outside edges of the fleet verify that no detectable changes occur at their stations; so the incident radiation from the distant pulsars can be regarded as unaffected by the newly placed stars. Data stations nearer the movable stars report the interesting data.
What are the results?
In a universe governed by the laws of special relativity (spacetime always flat), no patterns change. (Weber's fleet was unable to get checked
out in the first place, as no gravitational waves were ever detected from the pulsars). Neither stars, nor anything else, can produce gravitational fields. All aspects of the spacetime geometry are fixed a a aaa priori (complete prior geometry!). There is no gravity; and no light deflection takes place to make Hertz's captains adjust their positions.
In a universe governed by Nordstrom's theory of gravity (see text) both fleets get checked outi.e., both see waves. But neither fleet sees any changes in the rhythmic pattern of beats. The stars being towed about have no influence on either gravitational waves or electromagnetic waves. The prior geometry ( n n n\boldsymbol{n}n ) present in the theory precludes any light deflection or any gravitational-wave deflection.
In a universe governed by Whitehead's (1922) theory of gravity [see Will (1971b) and references cited therein], radio waves propagate along geodesics of the "physical metric" g g g\boldsymbol{g}g, and get deflected by the gravitational fields of the stars. But gravitational waves propagate along geodesics of a flat background metric η η eta\boldsymbol{\eta}η, and are thus unaffected by the stars. Consequently, Hertz's captains must maneuver to keep on station; and they hear a changing beat pattern between the reference pulsars and the other pulsars. But Weber's fleet remains on station and records no changes in the beat pattern. The prior geometry ( η ) ( η ) (eta)(\boldsymbol{\eta})(η) shows itself clearly in the experimental result.
In a universe governed by Einstein's theory, both fleets see effects (no sign of prior geometry because Einstein's theory has no prior geometry). Moreover, if the fleets were originally paired, one Weber ship and one Hertz at each station, they remain paired. No differences exist between the propagation of high-frequency light waves and high-frequency gravitational waves. Both propagate along geodesics of g g g\boldsymbol{g}g.
Mathematics was not sufficiently refined in 1917 to cleave apart the demands for "no prior geometry" and for a "geometric, coordinate-independent formulation of physics." Einstein described both demands by a single phrase, "general covariance." The "no-prior-geometry" demand actually fathered general relativity, but by doing so anonymously, disguised as "general covariance," it also fathered half a century of confusion. [See, e.g., Kretschmann (1917).]
A systematic treatment of the distinction between prior geometry ("absolute objects") and dynamic fields ("dynamic objects") is a notable feature of Anderson's (1967) relativity text.
"No prior geometry" as a part of Einstein's principle of "general covariance"

Exercise 17.8. EINSTEIN-FOKKER REDUCES TO NORDSTRØM

The vanishing of the Weyl tensor [equation (13.50)] for a spacetime metric g g g\boldsymbol{g}g guarantees that the metric is conformally flat-i.e., that there exists a scalar field ϕ ϕ phi\phiϕ such that g = ϕ 2 η g = ϕ 2 η g=phi^(2)eta\boldsymbol{g}=\phi^{2} \boldsymbol{\eta}g=ϕ2η, where η η eta\boldsymbol{\eta}η is a flat-spacetime metric. [See, e.g., Schouten (1954) for proof.] Thus, the EinsteinFokker equation (17.24), C α β μ ν = 0 C α β μ ν = 0 C^(alpha beta)_(mu nu)=0C^{\alpha \beta}{ }_{\mu \nu}=0Cαβμν=0, is equivalent to the Nordstrøm equation (17.23b). With this fact in hand, show that the Einstein-Fokker field equation R = 24 π T R = 24 π T R=24 pi TR=24 \pi TR=24πT reduces to the Nordstrøm field equation (17.23a).

EXERCISE

§17.7. A TASTE OF THE HISTORY OF EINSTEIN'S EQUATION

Nothing shows better what an idea is and means today than the battles and changes it has undergone on its way to its present form. A complete history of general relativity would demand a book. Here let a few key quotes from a few of the great papers give a little taste of what a proper history might encompass.
Einstein (1908): "We . . . will therefore in the following assume the complete physical equivalence of a gravitational field and of a corresponding acceleration of the reference system. . . . the clock at a point P P PPP for an observer anywhere in space runs ( 1 + Φ / c 2 ) 1 + Φ / c 2 (1+Phi//c^(2))\left(1+\Phi / c^{2}\right)(1+Φ/c2) times faster than the clock at the coordinate origin. . . . it follows that light rays are curved by the gravitational field. . . . an amount of energy E E EEE has a mass E / c 2 E / c 2 E//c^(2)E / c^{2}E/c2."
Einstein and Grossmann (1913): "The theory described here originates from the conviction that the proportionality between the inertial and the gravitational mass of a body is an exact law of nature that must be expressed as a foundation principle of theoretical physics. . . . An observer enclosed in an elevator has no way to decide whether the elevator is at rest in a static gravitational field or whether the elevator is located in gravitation-free space in an accelerated motion that is maintained by forces acting on the elevator (equivalence hypothesis). ... In the decay of radium, for example, that decrease [of mass] amounts to 1 / 10 , 000 1 / 10 , 000 1//10,0001 / 10,0001/10,000 of the total mass. If those changes in inertial mass did not correspond to changes in gravitational mass, then deviations of inertial from gravitational masses would arise that are far larger than the Eötvös experiments allow. It must therefore be considered as very probable that the identity of gravitational and inertial mass is exact.
"The sought for generalization will surely be of the form
Γ μ ν = κ T μ ν , Γ μ ν = κ T μ ν , Gamma_(mu nu)=kappaT_(mu nu),\Gamma_{\mu \nu}=\kappa T_{\mu \nu},Γμν=κTμν,
where κ κ kappa\kappaκ is a constant and Γ μ ν Γ μ ν Gamma_(mu nu)\Gamma_{\mu \nu}Γμν is a contravariant tensor of the second rank that arises out of the fundamental tensor g μ ν g μ ν g_(mu nu)g_{\mu \nu}gμν through differential operations. . . it proved impossible to find a differential expression for Γ μ ν Γ μ ν Gamma_(mu nu)\Gamma_{\mu \nu}Γμν that is a generalization of [Poisson's] Δ ϕ Δ ϕ Delta phi\Delta \phiΔϕ, and that is a tensor with respect to arbitrary transformations. . . . It seems most natural to demand that the system of equations should be covariant against arbitrary transformations. That stands in conflict with the result that the equations of the gravitational field do not possess this property."
Einstein and Grossman (1914): "In a 1913 treatment . . . we could not show general covariance for these gravitational equations. [Origin of their difficulty: part of the two-index curvature tensor was put on the left, to constitute the second-order part of the field equation, and part was put on the right with T μ ν T μ ν T_(mu nu)T_{\mu \nu}Tμν and was called gravitational stress-energy. It was asked that lefthand and righthand sides transform as tensors, which they cannot do under general coordinate transformations.]
Einstein (1915a): "In recent years I had been trying to found a general theory of relativity on the assumption of the relativity even of nonuniform motions. I believed in fact that I had found the only law of gravitation that corresponds to a reasonably formulated postulate of general relativity, and I sought to establish the necessity of exactly this solution in a paper that appeared last year in these proceedings.
"A renewed analysis showed me that that necessity absolutely was not shown in the approach adopted there; that it nevertheless appeared to be shown rested on an error.
"For these reasons, I lost all confidence in the field equations I had set up, and I sought for an approach that would limit the possibilities in a natural way. In this way I was led back to the demand for the general covariance of the field equations, from which I had departed three years ago, while working with my friend Grossmann, only with a heavy heart. In fact we had already at that time come quite near to the solution of the problem that is given in what follows.
"According to what has been said, it is natural to postulate the field equations of gravitation in the form
R μ v = κ T μ v R μ v = κ T μ v R_(mu v)=-kappaT_(mu v)R_{\mu v}=-\kappa T_{\mu v}Rμv=κTμv
since we already know that these equations are covariant with respect to arbitrary transformations of determinant 1 . In fact, these equations satisfy all conditions that we have to impose on them. [Here R μ ν R μ ν R_(mu nu)R_{\mu \nu}Rμν is a piece of the Ricci tensor that Einstein regarded as covariant.] . .
"Equations (22a) give in the first approximation
2 g α β x α x β = 0 2 g α β x α x β = 0 (del^(2)g^(alpha beta))/(delx^(alpha)delx^(beta))=0\frac{\partial^{2} g^{\alpha \beta}}{\partial x^{\alpha} \partial x^{\beta}}=02gαβxαxβ=0
By this [condition] the coordinate system is still not determined, in the sense that for this determination four equations are necessary." (Session of Nov. 4, 1915, published Nov. 11.)
Einstein (1915b): "In a recently published investigation, I have shown how a theory of the gravitational field can be founded on Riemann's covariant theory of many-di-
mensional manifolds. Here it will now be proved that, by introducing a surely bold additional hypothesis on the structure of matter, a still tighter logical structure of the theory can be achieved. . . . it may very well be possible that in the matter to which the given expression refers, gravitational fields play an essential part. Then T μ μ T μ μ T^(mu)_(mu)T^{\mu}{ }_{\mu}Tμμ can appear to be positive for the entire structure, although in reality only T μ μ + t μ μ T μ μ + t μ μ T^(mu)_(mu)+t^(mu)_(mu)T^{\mu}{ }_{\mu}+t^{\mu}{ }_{\mu}Tμμ+tμμ is positive, and T μ μ T μ μ T^(mu)_(mu)T^{\mu}{ }_{\mu}Tμμ vanishes everywhere. We assume in the following that in fact the condition T μ μ = 0 T μ μ = 0 T^(mu)_(mu)=0T^{\mu}{ }_{\mu}=0Tμμ=0 is fulfilled [quite] generally.
"Whoever does not from the beginning reject the hypothesis that molecular [smallscale] gravitational fields constitute an essential part of matter will see in the following a strong support for this point of view.
"Our hypothesis makes it possible . . . to give the field equations of gravitation in a generally covariant form . .
G μ ν = κ T μ ν G μ ν = κ T μ ν G_(mu nu)=-kappaT_(mu nu)G_{\mu \nu}=-\kappa T_{\mu \nu}Gμν=κTμν
[where G μ ν G μ ν G_(mu nu)G_{\mu \nu}Gμν is the Ricci tensor]." (Session of Nov. 11, 1915; published Nov. 18.)
Einstein (1915c): "I have shown that no objection of principle stands in the way of this hypothesis [the field equations], by which space and time are deprived of the last trace of objective reality. In the present work I find an important confirmation of this most radical theory of relativity: it turns out that it explains qualitatively and quantitatively the secular precession of the orbit of Mercury in the direction of the orbital motion, as discovered by Leverrier, which amounts to about 45 45 45^('')45^{\prime \prime}45 per century, without calling on any special hypothesis whatsoever."
Einstein (1915d; session of Nov. 25, 1915; published Dec. 2): "More recently I have found that one can proceed without hypotheses about the energy tensor of matter when one introduces the energy tensor of matter in a somewhat different way than was done in my two earlier communications. The field equations for the motion of the perihelion of Mercury are undisturbed by this modification. .
"Let us put
G i m = κ ( T i m 1 2 g i m T ) G i m = κ T i m 1 2 g i m T G_(im)=-kappa(T_(im)-(1)/(2)g_(im)T)G_{i m}=-\kappa\left(T_{i m}-\frac{1}{2} g_{i m} T\right)Gim=κ(Tim12gimT)
[where G i m G i m G_(im)G_{i m}Gim is the Ricci tensor]." . . .
. . . these equations, in contrast to (9), contain no new condition, so that no other assumption has to be made about the energy tensor of matter than obedience to the energy-momentum [conservation] laws.
"With this step, general relativity is finally completed as a logical structure. The postulate of relativity in its most general formulation, which makes the spacetime coordinates into physically meaningless parameters, leads compellingly to a completely determinate theory of gravitation that explains the perihelion motion of Mercury. In contrast, the general-relativity postulate is able to open up to us nothing about the nature of the other processes of nature that special relativity has not already taught. The opinion on this point that I recently expressed in these proceedings was erroneous. Every physical theory compatible with special relativity can be aligned into the system of general relativity by means of the absolute differential calculus, without [general relativity] supplying any criterion for the acceptability of that theory."
Hilbert (1915): "Axiom I [notation changed to conform to usage in this book]. The
law of physical events is determined through a world function [Mie's terminology; better known today as "Lagrangian"] L L LLL, that contains the following arguments:
g μ ν , g μ ν x α , 2 g μ ν x α x β , A σ , A σ x τ g μ ν , g μ ν x α , 2 g μ ν x α x β , A σ , A σ x τ {:[g_(mu nu)","(delg_(mu nu))/(delx^(alpha))","(del^(2)g_(mu nu))/(delx^(alpha)delx^(beta))","],[A_(sigma)","(delA_(sigma))/(delx^(tau))]:}\begin{gathered} g_{\mu \nu}, \frac{\partial g_{\mu \nu}}{\partial x^{\alpha}}, \frac{\partial^{2} g_{\mu \nu}}{\partial x^{\alpha} \partial x^{\beta}}, \\ A_{\sigma}, \frac{\partial A_{\sigma}}{\partial x^{\tau}} \end{gathered}gμν,gμνxα,2gμνxαxβ,Aσ,Aσxτ
and specifically the variation of the integral
L ( g ) 1 / 2 d 4 x L ( g ) 1 / 2 d 4 x int L(-g)^(1//2)d^(4)x\int L(-g)^{1 / 2} d^{4} xL(g)1/2d4x
must vanish for [changes in] every one of the 14 potentials g σ ν , A σ g σ ν , A σ g_(sigma nu),A_(sigma)dotsg_{\sigma \nu}, A_{\sigma} \ldotsgσν,Aσ
"Axiom II (axiom of general invariance). The world function L L LLL is invariant with respect to arbitrary transformations of the world parameters [coordinates] x α x α x^(alpha)x^{\alpha}xα. . .
"For the world function L L LLL, still further axioms are needed to make its choice unambiguous. If the gravitation equations are to contain only second derivatives of the potentials g σ ν g σ ν g^(sigma nu)g^{\sigma \nu}gσν, then L L LLL must have the form
L = R + L elec L = R + L elec  L=R+L_("elec ")L=R+L_{\text {elec }}L=R+Lelec 
where R R RRR is the invariant built from the Riemann tensor (curvature of the four-dimensional manifold." (Session of Nov. 20, 1915.)
Einstein (1916c): "Recently H. A. Lorentz and D. Hilbert have succeeded in giving general relativity an especially transparent form in deriving its equations from a single variation principle. This will be done also in the following treatment. There it is my aim to present the basic relations as transparently as possible and in a way as general as general relativity allows."
Einstein (1916b): "From this it follows, first of all, that gravitational fields spread out with the speed of light. . . . [plane] waves transport energy. . . . One thus gets . . . the radiation of the system per unit time. . . .
G 24 π α , β ( 3 J α β t 3 ) 2 . " G 24 π α , β 3 J α β t 3 2 . " (G)/(24 pi)sum_(alpha,beta)((del^(3)J_(alpha beta))/(delt^(3)))^(2)."\frac{G}{24 \pi} \sum_{\alpha, \beta}\left(\frac{\partial^{3} J_{\alpha \beta}}{\partial t^{3}}\right)^{2} . "G24πα,β(3Jαβt3)2."
Hilbert (1917): "As for the principle of causality, the physical quantities and their time-rates of change may be known at the present time in any given coordinate system; a prediction will then have a physical meaning only when it is invariant with respect to all those transformations for which exactly those coordinates used for the present time remain unchanged. I declare that predictions of this kind for the future are all uniquely determined; that is, that the causality principle holds in this formulation:
"From the knowledge of the 14 physical potentials g μ ν , A σ g μ ν , A σ g_(mu nu),A_(sigma)g_{\mu \nu}, A_{\sigma}gμν,Aσ, in the present, all predictions about the same quantities in the future follow necessarily and uniquely insofar as they have physical meaning."

стинтег 18

WEAK GRAVITATIONAL FIELDS

The way that can be walked on is not the perfect way. The word that can be said is not the perfect word.

§18.1. THE LINEARIZED THEORY OF GRAVITY

Because of the geometric language and abbreviations used in writing them, Einstein's field equations, G μ ν = 8 π T μ ν G μ ν = 8 π T μ ν G_(mu nu)=8piT_(mu nu)G_{\mu \nu}=8 \pi T_{\mu \nu}Gμν=8πTμν, hardly seem to be differential equations at all, much less ones with many familiar properties. The best way to see that they are is to apply them to weak-field situations
(18.1) g μ ν = η μ ν + h μ ν , | h μ ν | 1 , (18.1) g μ ν = η μ ν + h μ ν , h μ ν 1 , {:(18.1)g_(mu nu)=eta_(mu nu)+h_(mu nu)","quad|h_(mu nu)|≪1",":}\begin{equation*} g_{\mu \nu}=\eta_{\mu \nu}+h_{\mu \nu}, \quad\left|h_{\mu \nu}\right| \ll 1, \tag{18.1} \end{equation*}(18.1)gμν=ημν+hμν,|hμν|1,
e.g., to the solar system, where | h μ ν | | Φ | M / R 10 6 h μ ν | Φ | M / R 10 6 |h_(mu nu)|∼|Phi|≲M_(o.)//R_(o.)∼10^(-6)\left|h_{\mu \nu}\right| \sim|\Phi| \lesssim M_{\odot} / R_{\odot} \sim 10^{-6}|hμν||Φ|M/R106; or to a weak gravitational wave propagating through interstellar space.
In a weak-field situation, one can expand the field equations in powers of h μ ν h μ ν h_(mu nu)h_{\mu \nu}hμν, using a coordinate frame where (18.1) holds; and without much loss of accuracy, one can keep only linear terms. The resulting formalism is often called "the linearized theory of gravity," because it is an important theory in its own right. In fact, it is precisely this "linearized theory" that one obtains when one asks for the classical field corresponding to quantum-mechanical particles of (1) zero rest mass and (2) spin two in (3) flat spacetime [see Fierz and Pauli (1939)]. Track-2 readers have already explored linearized theory somewhat in §7.1, exercise 7.3, and Box 7.1. There it went under the alternative name, "tensor-field theory of gravity in flat spacetime."
"Linearized theory of gravity":
(1) as weak-field limit of general relativity
(2) as standard
"field-theory" description of gravity in "flat spacetime"
(3) as a foundation for "deriving" general relativity
Details of linearized theory:
(1) connection coefficients
(2) "gravitational potentials" h ¯ μ ν h ¯ μ ν bar(h)_(mu nu)\bar{h}_{\mu \nu}h¯μν
Just as one can "descend" from general relativity to linearized theory by linearizing about flat spacetime (see below), so one can "bootstrap" one's way back up from linearized theory to general relativity by imposing consistency between the linearized field equations and the equations of motion. or, equivalently, by asking about: (1) the stress-energy carried by the linearized gravitational field h μ ν h μ ν h_(mu nu)h_{\mu \nu}hμν; (2) the influence of this stress-energy acting as a source for corrections h ( 1 ) μ ν h ( 1 ) μ ν h^((1))_(mu nu)h^{(1)}{ }_{\mu \nu}h(1)μν to the field; (3) the stress-energy carried by the corrections h ( 1 ) μ ν h ( 1 ) μ ν h^((1))_(mu nu)h^{(1)}{ }_{\mu \nu}h(1)μν; (4) the influence of this stress-energy acting as a source for corrections h ( 2 ) μ ν h ( 2 ) μ ν h^((2))_(mu nu)h^{(2)}{ }_{\mu \nu}h(2)μν to the corrections h ( 1 ) μ ν ; ( 5 ) h ( 1 ) μ ν ; ( 5 ) h^((1))_(mu nu);(5)h^{(1)}{ }_{\mu \nu} ;(5)h(1)μν;(5) the stress-energy carried by the corrections to the corrections; and so on. This alternative way to derive general relativity has been developed and explored by Gupta (1954, 1957, 1962), Kraichnan (1955), Thirring (1961), Feynman (1963a), Weinberg (1965), and Deser (1970). But because the outlook is far from geometric (see Box 18.1), the details of the derivation are not presented here. (But see part 5 of Box 17.2.)
Here attention focuses on deriving linearized theory from general relativity. Adopt the form (18.1) for the metric components. The resulting connection coefficients [equations ( 8.24 b ) ( 8.24 b ) (8.24b)(8.24 \mathrm{~b})(8.24 b) ], when linearized in the metric perturbation h μ ν h μ ν h_(mu nu)h_{\mu \nu}hμν, read
Γ α β μ = 1 2 η μ ν ( h α v , β + h β v , α h α β , v ) (18.2) 1 2 ( h α μ , β + h β , α μ h α β μ ) Γ α β μ = 1 2 η μ ν h α v , β + h β v , α h α β , v (18.2) 1 2 h α μ , β + h β , α μ h α β μ {:[Gamma_(alpha beta)^(mu)=(1)/(2)eta^(mu nu)(h_(alpha v,beta)+h_(beta v,alpha)-h_(alpha beta,v))],[(18.2)-=(1)/(2)(h_(alpha)^(mu),beta:}],[{:+h_(beta^('),alpha)^(mu)-h_(alphabeta^('))^(mu))]:}\begin{align*} \Gamma_{\alpha \beta}^{\mu} & =\frac{1}{2} \eta^{\mu \nu}\left(h_{\alpha v, \beta}+h_{\beta v, \alpha}-h_{\alpha \beta, v}\right) \\ & \equiv \frac{1}{2}\left(h_{\alpha}{ }^{\mu}, \beta\right. \tag{18.2}\\ & \left.+h_{\beta^{\prime}, \alpha}^{\mu}-h_{\alpha \beta^{\prime}}{ }^{\mu}\right) \end{align*}Γαβμ=12ημν(hαv,β+hβv,αhαβ,v)(18.2)12(hαμ,β+hβ,αμhαβμ)
The second line here introduces the convention, used routinely whenever one expands in powers of h μ ν h μ ν h_(mu nu)h_{\mu \nu}hμν, that indices of h μ ν h μ ν h_(mu nu)h_{\mu \nu}hμν are raised and lowered using η μ ν η μ ν eta^(mu nu)\eta^{\mu \nu}ημν and η μ ν η μ ν eta_(mu nu)\eta_{\mu \nu}ημν, not g μ ν g μ ν g^(mu nu)g^{\mu \nu}gμν and g μ ν g μ ν g_(mu nu)g_{\mu \nu}gμν. A similar linearization of the Ricci tensor [equation (8.47)] yields
(18.3) R μ ν = Γ α μ ν , α Γ α μ α , ν = 1 2 ( h μ α , ν α + h ν α , μ α h μ ν , α α h , μ ν ) , (18.3) R μ ν = Γ α μ ν , α Γ α μ α , ν = 1 2 h μ α , ν α + h ν α , μ α h μ ν , α α h , μ ν , {:[(18.3)R_(mu nu)=Gamma^(alpha)_(mu nu,alpha)-Gamma^(alpha)_(mu alpha,nu)],[=(1)/(2)(h_(mu)^(alpha)_(,nu alpha)+(h_(nu)^(alpha),mu alpha)-h_(mu nu,alpha)^(alpha)-h_(,mu nu))","]:}\begin{align*} R_{\mu \nu} & =\Gamma^{\alpha}{ }_{\mu \nu, \alpha}-\Gamma^{\alpha}{ }_{\mu \alpha, \nu} \tag{18.3}\\ & =\frac{1}{2}\left(h_{\mu}^{\alpha}{ }_{, \nu \alpha}+{h_{\nu}{ }^{\alpha}, \mu \alpha}-h_{\mu \nu, \alpha}^{\alpha}-h_{, \mu \nu}\right), \end{align*}(18.3)Rμν=Γαμν,αΓαμα,ν=12(hμα,να+hνα,μαhμν,ααh,μν),
where
(18.4) h h α α = η α β h α β . (18.4) h h α α = η α β h α β . {:(18.4)h-=h_(alpha)^(alpha)=eta^(alpha beta)h_(alpha beta).:}\begin{equation*} h \equiv h_{\alpha}^{\alpha}=\eta^{\alpha \beta} h_{\alpha \beta} . \tag{18.4} \end{equation*}(18.4)hhαα=ηαβhαβ.
After a further contraction to form R g μ ν R μ ν η μ ν R μ ν R g μ ν R μ ν η μ ν R μ ν R-=g^(mu nu)R_(mu nu)~~eta^(mu nu)R_(mu nu)R \equiv g^{\mu \nu} R_{\mu \nu} \approx \eta^{\mu \nu} R_{\mu \nu}RgμνRμνημνRμν, one finds that the Einstein equations, 2 G μ ν = 16 π T μ ν 2 G μ ν = 16 π T μ ν 2G_(mu nu)=16 piT_(mu nu)2 G_{\mu \nu}=16 \pi T_{\mu \nu}2Gμν=16πTμν, read
(18.5) h μ α , ν α + h ν α , μ α h μ ν , α α h , μ ν η μ ν ( h α β , α β h , β β ) = 16 π T μ ν . (18.5) h μ α , ν α + h ν α , μ α h μ ν , α α h , μ ν η μ ν h α β , α β h , β β = 16 π T μ ν . {:[(18.5)h_(mu alpha,nu)^(alpha)+h_(nu alpha,mu)^(alpha)-h_(mu nu,alpha)^(alpha)-h_(,mu nu)],[-eta_(mu nu)(h_(alpha beta)^(,alpha beta)-h_(,beta)^(beta))=16 piT_(mu nu).]:}\begin{align*} h_{\mu \alpha, \nu}^{\alpha} & +h_{\nu \alpha, \mu}{ }^{\alpha}-h_{\mu \nu, \alpha}^{\alpha}-h_{, \mu \nu} \tag{18.5}\\ & -\eta_{\mu \nu}\left(h_{\alpha \beta}^{, \alpha \beta}-h_{, \beta}^{\beta}\right)=16 \pi T_{\mu \nu} . \end{align*}(18.5)hμα,να+hνα,μαhμν,ααh,μνημν(hαβ,αβh,ββ)=16πTμν.
The number of terms has increased in passing from R μ ν ( 18.3 ) R μ ν ( 18.3 ) R_(mu nu)(18.3)R_{\mu \nu}(18.3)Rμν(18.3) to G μ ν = R μ ν 1 2 g μ ν R G μ ν = R μ ν 1 2 g μ ν R G_(mu nu)=R_(mu nu)-(1)/(2)g_(mu nu)RG_{\mu \nu}=R_{\mu \nu}-\frac{1}{2} g_{\mu \nu} RGμν=Rμν12gμνR (18.5), but this annoyance can be counteracted by defining
(18.6) h ¯ μ ν h μ ν 1 2 η μ ν h (18.6) h ¯ μ ν h μ ν 1 2 η μ ν h {:(18.6) bar(h)_(mu nu)-=h_(mu nu)-(1)/(2)eta_(mu nu)h:}\begin{equation*} \bar{h}_{\mu \nu} \equiv h_{\mu \nu}-\frac{1}{2} \eta_{\mu \nu} h \tag{18.6} \end{equation*}(18.6)h¯μνhμν12ημνh
and using a bar to imply a corresponding operation on any other symmetric tensor.
Box 18.1 DERIVATIONS OF GENERAL RELATIVITY FROM GEOMETRIC VIEWPOINT AND FROM SPIN-TWO VIEWPOINT, COMPARED AND CONTRASTED
Einstein derivation Spin-2 derivation
Nature of primordial spacetime geometry? Not primordial; geometry is a dynamic participant in physics "God-given" flat Lorentz spacetime manifold
Topology (multiple connectedness) of spacetime? Laws of physics are local; they do not specify the topology Simply connected Euclidean topology
Vision of physics? Dynamic geometry is the "master field" of physics This field, that field, and the other field all execute their dynamics in a flatspacetime manifold
Starting points for this derivation of general relativity
1. Equivalence principle (world lines of photons and test particles are geodesics of the spacetime geometry)
2. That tensorial conserved quantity which is derived from the curvature (Cartan's moment of rotation) is to be identified with the tensor of stress-momentum-energy (see Chapter 15).
1. Equivalence principle (world lines of photons and test particles are geodesics of the spacetime geometry) 2. That tensorial conserved quantity which is derived from the curvature (Cartan's moment of rotation) is to be identified with the tensor of stress-momentum-energy (see Chapter 15).| 1. Equivalence principle (world lines of photons and test particles are geodesics of the spacetime geometry) | | :--- | | 2. That tensorial conserved quantity which is derived from the curvature (Cartan's moment of rotation) is to be identified with the tensor of stress-momentum-energy (see Chapter 15). |
1. Begin with field of spin two and zero rest mass in flat spacetime.
2. Stress-energy tensor built from this field serves as a source for this field.
1. Begin with field of spin two and zero rest mass in flat spacetime. 2. Stress-energy tensor built from this field serves as a source for this field.| 1. Begin with field of spin two and zero rest mass in flat spacetime. | | :--- | | 2. Stress-energy tensor built from this field serves as a source for this field. |
Resulting equations Einstein's field equations Einstein's field equations
Resulting assessment of the spacetime geometry from which derivation started Fundamental dynamic participant in physics None. Resulting theory eradicates original flat geometry from all equations, showing it to be unobservable
View about the greatest single crisis of physics to emerge from these equations: complete gravitational collapse Central to understanding the nature of matter and the universe Unimportant or at most peripheral
Einstein derivation Spin-2 derivation Nature of primordial spacetime geometry? Not primordial; geometry is a dynamic participant in physics "God-given" flat Lorentz spacetime manifold Topology (multiple connectedness) of spacetime? Laws of physics are local; they do not specify the topology Simply connected Euclidean topology Vision of physics? Dynamic geometry is the "master field" of physics This field, that field, and the other field all execute their dynamics in a flatspacetime manifold Starting points for this derivation of general relativity "1. Equivalence principle (world lines of photons and test particles are geodesics of the spacetime geometry) 2. That tensorial conserved quantity which is derived from the curvature (Cartan's moment of rotation) is to be identified with the tensor of stress-momentum-energy (see Chapter 15)." "1. Begin with field of spin two and zero rest mass in flat spacetime. 2. Stress-energy tensor built from this field serves as a source for this field." Resulting equations Einstein's field equations Einstein's field equations Resulting assessment of the spacetime geometry from which derivation started Fundamental dynamic participant in physics None. Resulting theory eradicates original flat geometry from all equations, showing it to be unobservable View about the greatest single crisis of physics to emerge from these equations: complete gravitational collapse Central to understanding the nature of matter and the universe Unimportant or at most peripheral| | Einstein derivation | Spin-2 derivation | | :---: | :---: | :---: | | Nature of primordial spacetime geometry? | Not primordial; geometry is a dynamic participant in physics | "God-given" flat Lorentz spacetime manifold | | Topology (multiple connectedness) of spacetime? | Laws of physics are local; they do not specify the topology | Simply connected Euclidean topology | | Vision of physics? | Dynamic geometry is the "master field" of physics | This field, that field, and the other field all execute their dynamics in a flatspacetime manifold | | Starting points for this derivation of general relativity | 1. Equivalence principle (world lines of photons and test particles are geodesics of the spacetime geometry) <br> 2. That tensorial conserved quantity which is derived from the curvature (Cartan's moment of rotation) is to be identified with the tensor of stress-momentum-energy (see Chapter 15). | 1. Begin with field of spin two and zero rest mass in flat spacetime. <br> 2. Stress-energy tensor built from this field serves as a source for this field. | | Resulting equations | Einstein's field equations | Einstein's field equations | | Resulting assessment of the spacetime geometry from which derivation started | Fundamental dynamic participant in physics | None. Resulting theory eradicates original flat geometry from all equations, showing it to be unobservable | | View about the greatest single crisis of physics to emerge from these equations: complete gravitational collapse | Central to understanding the nature of matter and the universe | Unimportant or at most peripheral |
Thus G μ ν = R ¯ μ ν G μ ν = R ¯ μ ν G_(mu nu)= bar(R)_(mu nu)G_{\mu \nu}=\bar{R}_{\mu \nu}Gμν=R¯μν to first order in the h μ ν h μ ν h_(mu nu)h_{\mu \nu}hμν, and h ¯ μ ν = h μ ν h ¯ ¯ μ ν = h μ ν bar(bar(h))_(mu nu)=h_(mu nu)\overline{\bar{h}}_{\mu \nu}=h_{\mu \nu}h¯μν=hμν; i.e., h μ ν = h ¯ μ ν 1 2 η μ ν h ¯ h μ ν = h ¯ μ ν 1 2 η μ ν h ¯ h_(mu nu)= bar(h)_(mu nu)-(1)/(2)eta_(mu nu) bar(h)h_{\mu \nu}=\bar{h}_{\mu \nu}-\frac{1}{2} \eta_{\mu \nu} \bar{h}hμν=h¯μν12ημνh¯. With this notation the linearized field equations become
h ¯ μ ν , α α η μ ν h ¯ α β , α β + h ¯ μ α , ν α + h ¯ ν α , μ α = 16 π T μ ν h ¯ μ ν , α α η μ ν h ¯ α β , α β + h ¯ μ α , ν α + h ¯ ν α , μ α = 16 π T μ ν - bar(h)_(mu nu,alpha)^(alpha)-eta_(mu nu) bar(h)_(alpha beta,)^(alpha beta)+ bar(h)_(mu alpha,_(nu))^(alpha)+ bar(h)_(nu alpha,_(mu))^(alpha)=16 piT_(mu nu)-\bar{h}_{\mu \nu, \alpha}^{\alpha}-\eta_{\mu \nu} \bar{h}_{\alpha \beta,}{ }^{\alpha \beta}+\bar{h}_{\mu \alpha,{ }_{\nu}}^{\alpha}+\bar{h}_{\nu \alpha,{ }_{\mu}}^{\alpha}=16 \pi T_{\mu \nu}h¯μν,ααημνh¯αβ,αβ+h¯μα,να+h¯να,μα=16πTμν
(18.7) (3) linearized field equations
The first term in these linearized equations is the usual flat-space d'Alembertian, and the other terms serve merely to keep the equations "gauge-invariant" (see Box
18.2). In Box 18.2 it is shown that, without loss of generality, one can impose the "gauge conditions"
(18.8a) h ¯ μ α , α = 0 . (18.8a) h ¯ μ α , α = 0 . {:(18.8a) bar(h)^(mu alpha)_(,alpha)=0.:}\begin{equation*} \bar{h}^{\mu \alpha}{ }_{, \alpha}=0 . \tag{18.8a} \end{equation*}(18.8a)h¯μα,α=0.
These gauge conditions are the tensor analog of the Lorentz gauge A α , α = 0 A α , α = 0 A^(alpha)_(,alpha)=0A^{\alpha}{ }_{, \alpha}=0Aα,α=0 of electromagnetic theory. The field equations (18.7) then become
(5) field equations and metric in Lorentz gauge
(18.8b) h ¯ μ ν , α α = 16 π T μ ν (18.8b) h ¯ μ ν , α α = 16 π T μ ν {:(18.8b)- bar(h)_(mu nu,alpha)^(alpha)=16 piT_(mu nu):}\begin{equation*} -\bar{h}_{\mu \nu, \alpha}{ }^{\alpha}=16 \pi T_{\mu \nu} \tag{18.8b} \end{equation*}(18.8b)h¯μν,αα=16πTμν
The gauge conditions (18.8a), the field equations (18.8b), and the definition of the metric
(18.8c) g μ ν = η μ ν + h μ ν = η μ ν + h ¯ μ ν 1 2 η μ ν h ¯ (18.8c) g μ ν = η μ ν + h μ ν = η μ ν + h ¯ μ ν 1 2 η μ ν h ¯ {:(18.8c)g_(mu nu)=eta_(mu nu)+h_(mu nu)=eta_(mu nu)+ bar(h)_(mu nu)-(1)/(2)eta_(mu nu) bar(h):}\begin{equation*} g_{\mu \nu}=\eta_{\mu \nu}+h_{\mu \nu}=\eta_{\mu \nu}+\bar{h}_{\mu \nu}-\frac{1}{2} \eta_{\mu \nu} \bar{h} \tag{18.8c} \end{equation*}(18.8c)gμν=ημν+hμν=ημν+h¯μν12ημνh¯
are the fundamental equations of the linearized theory of gravity written in Lorentz gauge.

EXERCISES

Exercise 18.1. GAUGE INVARIANCE OF THE RIEMANN CURVATURE

Show that in linearized theory the components of the Riemann tensor are
(18.9) R α μ β v = 1 2 ( h α v , μ β + h μ β , v α h μ v , α β h α β , μ v ) . (18.9) R α μ β v = 1 2 h α v , μ β + h μ β , v α h μ v , α β h α β , μ v . {:(18.9)R_(alpha mu beta v)=(1)/(2)(h_(alpha v,mu beta)+h_(mu beta,v alpha)-h_(mu v,alpha beta)-h_(alpha beta,mu v)).:}\begin{equation*} R_{\alpha \mu \beta v}=\frac{1}{2}\left(h_{\alpha v, \mu \beta}+h_{\mu \beta, v \alpha}-h_{\mu v, \alpha \beta}-h_{\alpha \beta, \mu v}\right) . \tag{18.9} \end{equation*}(18.9)Rαμβv=12(hαv,μβ+hμβ,vαhμv,αβhαβ,μv).
Then show that these components are left unchanged by a gauge transformation of the form discussed in Box 18.2 [equation (4b)]. Since the Einstein tensor is a contraction of the Riemann tensor, this shows that it is also gauge-invariant.

Exercise 18.2. JUSTIFICATION OF LORENTZ GAUGE

Let a particular solution to the field equations (18.7) of linearized theory be given, in an arbitrary gauge. Show that there necessarily exist four generating functions ξ μ ( t , x j ) ξ μ t , x j xi_(mu)(t,x^(j))\xi_{\mu}\left(t, x^{j}\right)ξμ(t,xj) whose gauge transformation [Box 18.2, eq. (4b)] makes
h ¯ new μ α , α = 0 (Lorentz gauge). h ¯ new  μ α , α = 0  (Lorentz gauge).  bar(h)^("new "mu alpha),alpha=0quad" (Lorentz gauge). "\bar{h}^{\text {new } \mu \alpha}, \alpha=0 \quad \text { (Lorentz gauge). }h¯new μα,α=0 (Lorentz gauge). 
Also show that a subsequent gauge transformation leaves this Lorentz gauge condition unaffected if and only if its generating functions satisfy the sourceless wave equation
ξ α , β β = 0 . ξ α , β β = 0 . xi^(alpha,beta)_(beta)=0.\xi^{\alpha, \beta}{ }_{\beta}=0 .ξα,ββ=0.

Exercise 18.3. EXTERNAL FIELD OF A STATIC, SPHERICAL BODY

Consider the external gravitational field of a static spherical body, as described in the body's (nearly) Lorentz frame-i.e., in a nearly rectangular coordinate system | h μ ν | 1 h μ ν 1 |h_(mu nu)|≪1\left|h_{\mu \nu}\right| \ll 1|hμν|1, in which the body is located at x = y = z = 0 x = y = z = 0 x=y=z=0x=y=z=0x=y=z=0 for all t t ttt. By fiat, adopt Lorentz gauge.
(a) Show that the field equations (18.8b) and gauge conditions (18.8a) imply
h ¯ 00 = 4 M / ( x 2 + y 2 + z 2 ) 1 / 2 , h ¯ 0 j = h ¯ j k = 0 , h 00 = h x x = h y y = h z z = 2 M / ( x 2 + y 2 + z 2 ) 1 / 2 , h α β = 0 if α β , h ¯ 00 = 4 M / x 2 + y 2 + z 2 1 / 2 , h ¯ 0 j = h ¯ j k = 0 , h 00 = h x x = h y y = h z z = 2 M / x 2 + y 2 + z 2 1 / 2 , h α β = 0  if  α β , {:[ bar(h)_(00)=4M//(x^(2)+y^(2)+z^(2))^(1//2)",", bar(h)_(0j)= bar(h)_(jk)=0","],[h_(00)=h_(xx)=h_(yy)=h_(zz)=2M//(x^(2)+y^(2)+z^(2))^(1//2)",",h_(alpha beta)=0" if "alpha!=beta","]:}\begin{array}{ll} \bar{h}_{00}=4 M /\left(x^{2}+y^{2}+z^{2}\right)^{1 / 2}, & \bar{h}_{0 j}=\bar{h}_{j k}=0, \\ h_{00}=h_{x x}=h_{y y}=h_{z z}=2 M /\left(x^{2}+y^{2}+z^{2}\right)^{1 / 2}, & h_{\alpha \beta}=0 \text { if } \alpha \neq \beta, \end{array}h¯00=4M/(x2+y2+z2)1/2,h¯0j=h¯jk=0,h00=hxx=hyy=hzz=2M/(x2+y2+z2)1/2,hαβ=0 if αβ,
where M M MMM is a constant (the mass of the body; see § 19.3 § 19.3 §19.3\S 19.3§19.3 ).

Box 18.2 GAUGE TRANSFORMATIONS AND COORDINATE TRANSFORMATIONS IN LINEARIZED THEORY

A. The Basic Equations of Linearized Theory, written in any coordinate system that is nearly globally Lorentz, are (18.1) and (18.7):
(1) g μ ν = η μ ν + h μ ν , | h μ ν | 1 ; (2) h ¯ μ ν , α α η μ ν h ¯ α β , , α β + h ¯ μ , , α ν + h ¯ ν α , μ = 16 π T μ ν . (1) g μ ν = η μ ν + h μ ν , h μ ν 1 ; (2) h ¯ μ ν , α α η μ ν h ¯ α β , , α β + h ¯ μ , , α ν + h ¯ ν α , μ = 16 π T μ ν . {:[(1)g_(mu nu)=eta_(mu nu)+h_(mu nu)","quad|h_(mu nu)|≪1;],[(2)- bar(h)_(mu nu,alpha)^(alpha)-eta_(mu nu) bar(h)_(alpha beta)","^(,alpha beta)+ bar(h)_(mu,)","alpha_(nu)+ bar(h)_(nu alpha)","_(mu)=16 piT_(mu nu).]:}\begin{gather*} g_{\mu \nu}=\eta_{\mu \nu}+h_{\mu \nu}, \quad\left|h_{\mu \nu}\right| \ll 1 ; \tag{1}\\ -\bar{h}_{\mu \nu, \alpha}{ }^{\alpha}-\eta_{\mu \nu} \bar{h}_{\alpha \beta},{ }^{, \alpha \beta}+\bar{h}_{\mu,}, \alpha{ }_{\nu}+\bar{h}_{\nu \alpha},{ }_{\mu}=16 \pi T_{\mu \nu} . \tag{2} \end{gather*}(1)gμν=ημν+hμν,|hμν|1;(2)h¯μν,ααημνh¯αβ,,αβ+h¯μ,,αν+h¯να,μ=16πTμν.
Two different types of coordinate transformations connect nearly globally Lorentz systems to each other: global Lorentz transformations, and infinitesimal coordinate transformations.
  1. Global Lorentz Transformations:
(3a) x μ = Λ α μ x α , Λ μ α Λ ν β η μ ν = η α β . (3a) x μ = Λ α μ x α , Λ μ α Λ ν β η μ ν = η α β . {:(3a)x^(mu)=Lambda_(alpha^('))^(mu)x^(alpha^('))","quadLambda^(mu)_(alpha^('))Lambda^(nu)_(beta^('))eta_(mu nu)=eta_(alpha^(')beta^(')).:}\begin{equation*} x^{\mu}=\Lambda_{\alpha^{\prime}}^{\mu} x^{\alpha^{\prime}}, \quad \Lambda^{\mu}{ }_{\alpha^{\prime}} \Lambda^{\nu}{ }_{\beta^{\prime}} \eta_{\mu \nu}=\eta_{\alpha^{\prime} \beta^{\prime}} . \tag{3a} \end{equation*}(3a)xμ=Λαμxα,ΛμαΛνβημν=ηαβ.
These transform the metric coefficients via
η α β + h α β = g α β = x μ x α x ν x β g μ ν = Λ μ α α Λ β ( η μ ν + h μ ν ) = η α β + Λ μ α Λ β h μ ν . η α β + h α β = g α β = x μ x α x ν x β g μ ν = Λ μ α α Λ β η μ ν + h μ ν = η α β + Λ μ α Λ β h μ ν . {:[eta_(alpha^(')beta^('))+h_(alpha^(')beta^('))=g_(alpha^(')beta^('))=(delx^(mu))/(delx^(alpha^(')))(delx^(nu))/(delx^(beta^(')))g_(mu nu)=Lambda^(mu)alpha_(alpha^('))Lambda_(beta^('))^(')(eta_(mu nu)+h_(mu nu))],[=eta_(alpha^(')beta^('))+Lambda^(mu)_(alpha^('))Lambda^(')_(beta^('))h_(mu nu).]:}\begin{aligned} \eta_{\alpha^{\prime} \beta^{\prime}}+h_{\alpha^{\prime} \beta^{\prime}}=g_{\alpha^{\prime} \beta^{\prime}} & =\frac{\partial x^{\mu}}{\partial x^{\alpha^{\prime}}} \frac{\partial x^{\nu}}{\partial x^{\beta^{\prime}}} g_{\mu \nu}=\Lambda^{\mu} \alpha_{\alpha^{\prime}} \Lambda_{\beta^{\prime}}{ }^{\prime}\left(\eta_{\mu \nu}+h_{\mu \nu}\right) \\ & =\eta_{\alpha^{\prime} \beta^{\prime}}+\Lambda^{\mu}{ }_{\alpha^{\prime}} \Lambda^{\prime}{ }_{\beta^{\prime}} h_{\mu \nu} . \end{aligned}ηαβ+hαβ=gαβ=xμxαxνxβgμν=ΛμααΛβ(ημν+hμν)=ηαβ+ΛμαΛβhμν.
Thus, h μ ν h μ ν h_(mu nu)h_{\mu \nu}hμν-and likewise h ¯ μ ν h ¯ μ ν bar(h)_(mu nu)\bar{h}_{\mu \nu}h¯μν-transform like components of a tensor in flat spacetime
(3b) h α β = Λ μ α Λ ν β h μ ν . (3b) h α β = Λ μ α Λ ν β h μ ν . {:(3b)h_(alpha^(')beta^('))=Lambda^(mu)_(alpha^('))Lambda^(nu)_(beta^('))h_(mu nu).:}\begin{equation*} h_{\alpha^{\prime} \beta^{\prime}}=\Lambda^{\mu}{ }_{\alpha^{\prime}} \Lambda^{\nu}{ }_{\beta^{\prime}} h_{\mu \nu} . \tag{3b} \end{equation*}(3b)hαβ=ΛμαΛνβhμν.
  1. Infinitesimal Coordinate Transformations (creation of "ripples" in the coordinate system):
(4a) x μ ( P ) = x μ ( P ) + ξ μ ( P ) , , (4a) x μ ( P ) = x μ ( P ) + ξ μ ( P ) , , {:(4a)x^(mu^(')(P)=x^(mu)(P)+xi^(mu)(P),,):}\begin{equation*} x^{\mu^{\prime}(\mathscr{P})=x^{\mu}(\mathscr{P})+\xi^{\mu}(\mathscr{P}), ~, ~} \tag{4a} \end{equation*}(4a)xμ(P)=xμ(P)+ξμ(P), , 
where ξ μ ( P ) ξ μ ( P ) xi^(mu)(P)\xi^{\mu}(\mathscr{P})ξμ(P) are four arbitrary functions small enough to leave | h μ ν | 1 h μ ν 1 |h_(mu^(')nu^('))|≪1\left|h_{\mu^{\prime} \nu^{\prime}}\right| \ll 1|hμν|1. Infinitesimal transformations of this sort make tiny changes in the functional forms of all scalar, vector, and tensor fields. Example: the temperature T T TTT is a unique function of position, T ( P ) T ( P ) T(P)T(\mathscr{P})T(P); so when written as a function of coordinates it changes
T ( x μ = a μ ) = T ( x μ + ξ μ = a μ ) = T ( x μ = a μ ξ μ ) = T ( x μ = a μ ) T , μ ξ μ ; T x μ = a μ = T x μ + ξ μ = a μ = T x μ = a μ ξ μ = T x μ = a μ T , μ ξ μ ; {:[T(x^(mu^('))=a^(mu))=T(x^(mu)+xi^(mu)=a^(mu))=T(x^(mu)=a^(mu)-xi^(mu))],[=T(x^(mu)=a^(mu))-T_(,mu)xi^(mu);]:}\begin{aligned} T\left(x^{\mu^{\prime}}=a^{\mu}\right) & =T\left(x^{\mu}+\xi^{\mu}=a^{\mu}\right)=T\left(x^{\mu}=a^{\mu}-\xi^{\mu}\right) \\ & =T\left(x^{\mu}=a^{\mu}\right)-T_{, \mu} \xi^{\mu} ; \end{aligned}T(xμ=aμ)=T(xμ+ξμ=aμ)=T(xμ=aμξμ)=T(xμ=aμ)T,μξμ;
i.e., if ξ 0 = 0.001 sin ( x 1 ) ξ 0 = 0.001 sin x 1 xi^(0)=0.001 sin(x^(1))\xi^{0}=0.001 \sin \left(x^{1}\right)ξ0=0.001sin(x1), and if T = cos 2 ( x 0 ) T = cos 2 x 0 T=cos^(2)(x^(0))T=\cos ^{2}\left(x^{0}\right)T=cos2(x0), then
T = cos 2 ( x 0 ) + 0.002 sin ( x 1 ) cos ( x 0 ) sin ( x 0 ) . T = cos 2 x 0 + 0.002 sin x 1 cos x 0 sin x 0 . T=cos^(2)(x^(0^(')))+0.002 sin(x^(1^(')))cos(x^(0^(')))sin(x^(0^('))).T=\cos ^{2}\left(x^{0^{\prime}}\right)+0.002 \sin \left(x^{1^{\prime}}\right) \cos \left(x^{0^{\prime}}\right) \sin \left(x^{0^{\prime}}\right) .T=cos2(x0)+0.002sin(x1)cos(x0)sin(x0).

Box 18.2 (continued)

These tiny changes can be ignored in all quantities except the metric, where tiny deviations from η μ ν η μ ν eta_(mu nu)\eta_{\mu \nu}ημν contain all the information about gravity. The usual tensor transformation law for the metric
g ρ σ [ x α ( P ) ] = g μ ν [ x α ( P ) ] x μ x ρ x ν x σ , g ρ σ x α ( P ) = g μ ν x α ( P ) x μ x ρ x ν x σ , g_(rho^(')sigma^('))[x^({:alpha^(')(P)])=g_(mu nu)[x^(alpha)(P)](delx^(mu))/(delx^(rho^(')))(delx^(nu))/(delx^(sigma^('))),:}g_{\rho^{\prime} \sigma^{\prime}}\left[x^{\left.\alpha^{\prime}(\mathscr{P})\right]}=g_{\mu \nu}\left[x^{\alpha}(\mathscr{P})\right] \frac{\partial x^{\mu}}{\partial x^{\rho^{\prime}}} \frac{\partial x^{\nu}}{\partial x^{\sigma^{\prime}}},\right.gρσ[xα(P)]=gμν[xα(P)]xμxρxνxσ,
when combined with the transformation law (4a) and with
g μ ν [ x α ( P ) ] = η μ ν + h μ ν [ x α ( P ) ] , g μ ν x α ( P ) = η μ ν + h μ ν x α ( P ) , g_(mu nu)[x^(alpha)(P)]=eta_(mu nu)+h_(mu nu)[x^(alpha)(P)],g_{\mu \nu}\left[x^{\alpha}(\mathscr{P})\right]=\eta_{\mu \nu}+h_{\mu \nu}\left[x^{\alpha}(\mathscr{P})\right],gμν[xα(P)]=ημν+hμν[xα(P)],
reveals that
g ρ σ ( x α = a α ) = η ρ σ + h ρ σ ( x α = a α ) ξ ρ , σ ξ σ , ρ + negligible corrections h ρ σ , α ξ α and h ρ α ξ α , σ . g ρ σ x α = a α = η ρ σ + h ρ σ x α = a α ξ ρ , σ ξ σ , ρ +  negligible corrections  h ρ σ , α ξ α  and  h ρ α ξ α , σ . {:[g_(rho^(')sigma^('))(x^(alpha^('))=a^(alpha))=eta_(rho sigma)+h_(rho sigma)(x^(alpha)=a^(alpha))-xi_(rho,sigma)-xi_(sigma,rho)],[+" negligible corrections "∼h_(rho sigma,alpha)xi^(alpha)" and "∼h_(rho alpha)xi^(alpha)_(,sigma).]:}\begin{aligned} g_{\rho^{\prime} \sigma^{\prime}}\left(x^{\alpha^{\prime}}=a^{\alpha}\right)= & \eta_{\rho \sigma}+h_{\rho \sigma}\left(x^{\alpha}=a^{\alpha}\right)-\xi_{\rho, \sigma}-\xi_{\sigma, \rho} \\ & + \text { negligible corrections } \sim h_{\rho \sigma, \alpha} \xi^{\alpha} \text { and } \sim h_{\rho \alpha} \xi^{\alpha}{ }_{, \sigma} . \end{aligned}gρσ(xα=aα)=ηρσ+hρσ(xα=aα)ξρ,σξσ,ρ+ negligible corrections hρσ,αξα and hραξα,σ.
Hence, the metric perturbation functions in the new ( x μ x μ x^(mu^('))x^{\mu^{\prime}}xμ ) and old ( x μ x μ x^(mu)x^{\mu}xμ ) coordinate systems are related by
(4b) h μ ν new = h μ ν old ξ μ , ν ξ ν , μ ν , (4b) h μ ν new = h μ ν old ξ μ , ν ξ ν , μ ν , {:(4b)h_(mu nu)^(new)=h_(mu nu)^(old)-xi_(mu,nu)-xi_(nu,mu nu)",":}\begin{equation*} h_{\mu \nu}^{\mathrm{new}}=h_{\mu \nu}^{\mathrm{old}}-\xi_{\mu, \nu}-\xi_{\nu, \mu \nu}, \tag{4b} \end{equation*}(4b)hμνnew=hμνoldξμ,νξν,μν,
whereas the functional forms of all other scalars, vectors, and tensors are unaltered, to within the precision of linearized theory.
B. Gauge Transformations and Gauge Invariance. In linearized theory one usually regards equation (4b) as gauge transformations, analogous to those
(5a) A μ new = A μ old + Ψ , μ (5a) A μ new  = A μ old  + Ψ , μ {:(5a)A_(mu)^("new ")=A_(mu)^("old ")+Psi_(,mu):}\begin{equation*} A_{\mu}^{\text {new }}=A_{\mu}^{\text {old }}+\Psi_{, \mu} \tag{5a} \end{equation*}(5a)Aμnew =Aμold +Ψ,μ
of electromagnetic theory. The fact that gravitational gauge transformations do not affect the functional forms of scalars, vectors, or tensors (i.e., observables) is called "gauge invariance." Just as a straightforward calculation reveals the gauge invariance of the electromagnetic field,
(5b) F μ ν new = A ν , μ new A μ , ν new = A ν , μ old + Ψ , ν μ A μ , ν old Ψ , μ ν = F μ ν old , (5b) F μ ν new = A ν , μ new A μ , ν new = A ν , μ old  + Ψ , ν μ A μ , ν old  Ψ , μ ν = F μ ν old  , {:(5b)F_(mu nu)^(new)=A_(nu,mu)^(new)-A_(mu,nu)^(new)=A_(nu,mu)^("old ")+Psi_(,nu mu)-A_(mu,nu)^("old ")-Psi_(,mu nu)=F_(mu nu)^("old ")",":}\begin{equation*} F_{\mu \nu}^{\mathrm{new}}=A_{\nu, \mu}^{\mathrm{new}}-A_{\mu, \nu}^{\mathrm{new}}=A_{\nu, \mu}^{\text {old }}+\Psi_{, \nu \mu}-A_{\mu, \nu}^{\text {old }}-\Psi_{, \mu \nu}=F_{\mu \nu}^{\text {old }}, \tag{5b} \end{equation*}(5b)Fμνnew=Aν,μnewAμ,νnew=Aν,μold +Ψ,νμAμ,νold Ψ,μν=Fμνold ,
so a straightforward calculation (exercise 18.1) reveals the gauge invariance of the Riemann tensor
(6) R μ ν α β new = R μ ν α β old . (6) R μ ν α β new  = R μ ν α β old  . {:(6)R_(mu nu alpha beta)^("new ")=R_(mu nu alpha beta)^("old ").:}\begin{equation*} R_{\mu \nu \alpha \beta}^{\text {new }}=R_{\mu \nu \alpha \beta}^{\text {old }} . \tag{6} \end{equation*}(6)Rμναβnew =Rμναβold .
Such gauge invariance was already guaranteed by the fact that R μ ν α β R μ ν α β R_(mu nu alpha beta)R_{\mu \nu \alpha \beta}Rμναβ are the components of a tensor, and are thus essentially the same whether calculated in an orthonormal frame g μ ^ ν = η μ ν g μ ^ ν = η μ ν g_( hat(mu)nu)=eta_(mu nu)g_{\hat{\mu} \nu}=\eta_{\mu \nu}gμ^ν=ημν, in the old coordinates where g μ ν = η μ ν + h μ ν old g μ ν = η μ ν + h μ ν old  g_(mu nu)=eta_(mu nu)+h_(mu nu)^("old ")g_{\mu \nu}=\eta_{\mu \nu}+h_{\mu \nu}^{\text {old }}gμν=ημν+hμνold , or in the new coordinates where g μ ν = η μ ν + h μ ν new g μ ν = η μ ν + h μ ν new  g_(mu nu)=eta_(mu nu)+h_(mu nu)^("new ")g_{\mu \nu}=\eta_{\mu \nu}+h_{\mu \nu}^{\text {new }}gμν=ημν+hμνnew .
Like the Riemann tensor, the Einstein tensor and the stress-energy tensor are unaffected by gauge transformations. Hence, if one knows a specific solution h ¯ μ ν h ¯ μ ν bar(h)_(mu nu)\bar{h}_{\mu \nu}h¯μν to the linearized field equations (2) for a given T μ ν T μ ν T^(mu nu)T^{\mu \nu}Tμν, one can obtain another solution that describes precisely the same physical situation (all observables unchanged) by the change of gauge (4), in which ξ μ ξ μ xi_(mu)\xi_{\mu}ξμ are four arbitrary but small functions.
C. Lorentz Gauge. One can show (exercise 18.2) that for any physical situation, one can specialize the gauge (i.e., the coordinates) so that h ¯ μ α , α = 0 h ¯ μ α , α = 0 bar(h)^(mu alpha)_(,alpha)=0\bar{h}^{\mu \alpha}{ }_{, \alpha}=0h¯μα,α=0. This is the Lorentz gauge introduced in §18.1. The Lorentz gauge is not fixed uniquely. The gauge condition h ¯ μ α , α = 0 h ¯ μ α , α = 0 bar(h)^(mu alpha)_(,alpha)=0\bar{h}^{\mu \alpha}{ }_{, \alpha}=0h¯μα,α=0 is left unaffected by any gauge transformation for which
ξ α , β β = 0 . ξ α , β β = 0 . xi^(alpha,beta)_(beta)=0.\xi^{\alpha, \beta}{ }_{\beta}=0 .ξα,ββ=0.
(See exercise 18.2.)
D. Curvilinear Coordinate Systems. Once the gauge has been fixed by fiat for a given system (e.g., the solar system), one can regard h μ ν h μ ν h_(mu nu)h_{\mu \nu}hμν and h ¯ μ ν h ¯ μ ν bar(h)_(mu nu)\bar{h}_{\mu \nu}h¯μν as components of tensors in flat spacetime; and one can regard the field equations (2) and the chosen gauge conditions as geometric, coordinate-independent equations in flat spacetime. This viewpoint allows one to use curvilinear coordinates (e.g., spherical coordinates centered on the sun), if one wishes. But in doing so, one must everywhere replace the Lorentz components of the metric, η μ ν η μ ν eta_(mu nu)\eta_{\mu \nu}ημν, by the metric's components g μ v g μ v g_(mu v)g_{\mu v}gμv flat in the flat-spacetime curvilinear coordinate system; and one must replace all ordinary derivatives ("commas") in the field equations and gauge conditions by covariant derivatives whose connection coefficients come from g μ ν g μ ν g_(mu nu)g_{\mu \nu}gμν flat . See exercise 18.3 for an example.
(b) Adopt spherical polar coordinates,
x = r sin θ cos ϕ , y = r sin θ sin ϕ , z = r cos θ x = r sin θ cos ϕ , y = r sin θ sin ϕ , z = r cos θ x=r sin theta cos phi,quad y=r sin theta sin phi,quad z=r cos thetax=r \sin \theta \cos \phi, \quad y=r \sin \theta \sin \phi, \quad z=r \cos \thetax=rsinθcosϕ,y=rsinθsinϕ,z=rcosθ
By regarding h μ ν h μ ν h_(mu nu)h_{\mu \nu}hμν and h ¯ μ ν h ¯ μ ν bar(h)_(mu nu)\bar{h}_{\mu \nu}h¯μν as components of tensors in flat spacetime (see end of Box 18.2), and by using the usual tensor transformation laws, put the solution found in (a) into the form
h ¯ 00 = 4 M / r , h ¯ 0 j = h ¯ j k = 0 , h 00 = 2 M r , h 0 j = 0 , h j k = 2 M r g j k nat h ¯ 00 = 4 M / r , h ¯ 0 j = h ¯ j k = 0 , h 00 = 2 M r , h 0 j = 0 , h j k = 2 M r g j k  nat  {:[ bar(h)_(00)=4M//r","quad bar(h)_(0j)= bar(h)_(jk)=0","],[h_(00)=(2M)/(r)","quadh_(0j)=0","quadh_(jk)=(2M)/(r)g_(jk)" nat "]:}\begin{gathered} \bar{h}_{00}=4 M / r, \quad \bar{h}_{0 j}=\bar{h}_{j k}=0, \\ h_{00}=\frac{2 M}{r}, \quad h_{0 j}=0, \quad h_{j k}=\frac{2 M}{r} g_{j k} \text { nat } \end{gathered}h¯00=4M/r,h¯0j=h¯jk=0,h00=2Mr,h0j=0,hjk=2Mrgjk nat 
where g α β g α β g_(alpha beta)g_{\alpha \beta}gαβ flat are the components of the flat-spacetime metric in the spherical coordinate system
g 00 flat = 1 , g r r nat = 1 , g θ θ flat = r 2 , g ϕ ϕ fat = r 2 sin 2 θ , g α β = 0 when α β . g 00 flat  = 1 , g r r nat  = 1 , g θ θ flat  = r 2 , g ϕ ϕ fat  = r 2 sin 2 θ , g α β = 0  when  α β . {:[g_(00_("flat "))=-1","quadg_(rr_("nat "))=1","quadg_(theta theta_("flat "))=r^(2)","],[g_(phiphi_("fat "))=r^(2)sin^(2)theta","quadg_(alpha beta)=0" when "alpha!=beta.]:}\begin{gathered} g_{00_{\text {flat }}}=-1, \quad g_{r r_{\text {nat }}}=1, \quad g_{\theta \theta{ }_{\text {flat }}}=r^{2}, \\ g_{\phi \phi_{\text {fat }}}=r^{2} \sin ^{2} \theta, \quad g_{\alpha \beta}=0 \text { when } \alpha \neq \beta . \end{gathered}g00flat =1,grrnat =1,gθθflat =r2,gϕϕfat =r2sin2θ,gαβ=0 when αβ.
Thereby conclude that the general relativistic line element, accurate to linearized order, is
d s 2 = ( 1 2 M / r ) d t 2 + ( 1 + 2 M / r ) ( d r 2 + r 2 d θ 2 + r 2 sin 2 θ d ϕ 2 ) d s 2 = ( 1 2 M / r ) d t 2 + ( 1 + 2 M / r ) d r 2 + r 2 d θ 2 + r 2 sin 2 θ d ϕ 2 ds^(2)=-(1-2M//r)dt^(2)+(1+2M//r)(dr^(2)+r^(2)dtheta^(2)+r^(2)sin^(2)theta dphi^(2))d s^{2}=-(1-2 M / r) d t^{2}+(1+2 M / r)\left(d r^{2}+r^{2} d \theta^{2}+r^{2} \sin ^{2} \theta d \phi^{2}\right)ds2=(12M/r)dt2+(1+2M/r)(dr2+r2dθ2+r2sin2θdϕ2)
(c) Derive this general, static, spherically symmetric, Lorentz-gauge, vacuum solution to the linearized field equations from scratch, working entirely in spherical coordinates. [Hint: As discussed at the end of Box 18.2, η μ ν η μ ν eta_(mu nu)\eta_{\mu \nu}ημν in equation (18.8c) must be replaced by g μ ν g μ ν g_(mu nu)g_{\mu \nu}gμν fat ; and in the field equations and gauge conditions ( 18.8 a , b 18.8 a , b 18.8a,b18.8 \mathrm{a}, \mathrm{b}18.8a,b ), all commas (partial derivatives) must be replaced by covariant derivatives, whose connection coefficients come from g μ ν g μ ν g_(mu nu)g_{\mu \nu}gμν flat .] (d) Calculate the Riemann curvature tensor for this gravitational field. The answer should agree with equation (1.14).
Linearized theory and electromagnetic theory compared
How to analyze effects of weak gravity on matter

§18.2. GRAVITATIONAL WAVES

The gauge conditions and field equations (18.8a, b) of linearized theory bear a close resemblance to the equations of electromagnetic theory in Lorentz gauge and flat spacetime,
(18.10a) A , α α = 0 (18.10b) A , α μ α = 4 π J μ (18.10a) A , α α = 0 (18.10b) A , α μ α = 4 π J μ {:[(18.10a)A_(,alpha)^(alpha)=0],[(18.10b)-A_(,alpha)^(mu)^(alpha)=4piJ^(mu)]:}\begin{align*} A_{, \alpha}^{\alpha} & =0 \tag{18.10a}\\ -A_{, \alpha}^{\mu}{ }^{\alpha} & =4 \pi J^{\mu} \tag{18.10b} \end{align*}(18.10a)A,αα=0(18.10b)A,αμα=4πJμ
They differ only in the added index ( h μ ν h μ ν h^(mu nu)h^{\mu \nu}hμν versus A μ , T μ ν A μ , T μ ν A^(mu),T^(mu nu)A^{\mu}, T^{\mu \nu}Aμ,Tμν versus J μ J μ J^(mu)J^{\mu}Jμ ). Consequently, from past experience with electromagnetic theory, one can infer much about linearized gravitation theory.
For example, the field equations (18.8b) must have gravitational-wave solutions. The analog of the electromagnetic plane wave
A x = A x ( t z ) , A y = A y ( t z ) , A z = 0 , A 0 = 0 A x = A x ( t z ) , A y = A y ( t z ) , A z = 0 , A 0 = 0 A^(x)=A^(x)(t-z),quadA^(y)=A^(y)(t-z),quadA^(z)=0,quadA^(0)=0A^{x}=A^{x}(t-z), \quad A^{y}=A^{y}(t-z), \quad A^{z}=0, \quad A^{0}=0Ax=Ax(tz),Ay=Ay(tz),Az=0,A0=0
will be the gravitational plane wave
h ¯ x x = h ¯ x x ( t z ) , h ¯ x y = h ¯ x y ( t z ) , h ¯ y y = h ¯ y y ( t z ) , (18.11) h ¯ μ 0 = h ¯ μ z = 0 for all μ . h ¯ x x = h ¯ x x ( t z ) , h ¯ x y = h ¯ x y ( t z ) , h ¯ y y = h ¯ y y ( t z ) , (18.11) h ¯ μ 0 = h ¯ μ z = 0  for all  μ . {:[ bar(h)^(xx)= bar(h)^(xx)(t-z)","quad bar(h)^(xy)= bar(h)^(xy)(t-z)","quad bar(h)^(yy)= bar(h)^(yy)(t-z)","],[(18.11) bar(h)^(mu0)= bar(h)^(mu z)=0" for all "mu.]:}\begin{gather*} \bar{h}^{x x}=\bar{h}^{x x}(t-z), \quad \bar{h}^{x y}=\bar{h}^{x y}(t-z), \quad \bar{h}^{y y}=\bar{h}^{y y}(t-z), \\ \bar{h}^{\mu 0}=\bar{h}^{\mu z}=0 \text { for all } \mu . \tag{18.11} \end{gather*}h¯xx=h¯xx(tz),h¯xy=h¯xy(tz),h¯yy=h¯yy(tz),(18.11)h¯μ0=h¯μz=0 for all μ.
Although a detailed study of such waves will be delayed until Chapters 35-37, some properties of these waves are explored in the exercises at the end of the next section.

§18.3. EFFECT OF GRAVITY ON MATTER

The effects of weak gravitational fields on matter can be computed by using the linearized metric (18.1) and Christoffel symbols (18.2) in the appropriate equations of motion-i.e., in the geodesic equation (for the motion of particles or light rays), in the hydrodynamic equations (for fluid matter), in Maxwell's equations (for electromagnetic waves), or in the equation T = 0 T = 0 grad*T=0\boldsymbol{\nabla} \cdot \boldsymbol{T}=0T=0 for the total stress-energy tensor
of whatever fields and matter may be present. Exercises 18.5, 18.6 and 18.7 provide examples, as do the Newtonian-limit calculations in exercises 16.1 and 16.4, and in §17.4. If, however, the lowest-order (linearized) gravitational "forces" (Chris-toffel-symbol terms) have a significant influence on the motion of the sources of the gravitational field, one finds that the linearized field equation (18.7) is inadequate, and better approximations to Einstein's equations must be considered. [Thus emission of gravitational waves by a mechanically or electrically driven oscillator falls within the scope of linearized theory, but emission by a double-star system, or by stellar oscillations that gravitational forces maintain, will require discussion of nonlinear terms (gravitational "stress-energy") in the Einstein equations; see § § 36.9 § § 36.9 §§36.9\S \S 36.9§§36.9 to 36.11.]
The above conclusions follow from a consideration of conservation laws associated with the linearized field equation. Just as the electromagnetic equations (18.10a, b) guarantee charge conservation
J μ , μ = 0 , all space J 0 ( t , x ) d x d y d z Q = const , J μ , μ = 0 , all space  J 0 ( t , x ) d x d y d z Q =  const  , J^(mu)_(,mu)=0,quadint_("all space ")J^(0)(t,x)dxdydz-=Q=" const ",J^{\mu}{ }_{, \mu}=0, \quad \int_{\text {all space }} J^{0}(t, \boldsymbol{x}) d x d y d z \equiv Q=\text { const },Jμ,μ=0,all space J0(t,x)dxdydzQ= const ,
so the gravitational equations ( 18.8 ( 18.8 (18.8(18.8(18.8 a, b) guarantee conservation of the total 4 -momentum and angular momentum of any body bounded by vacuum:
(18.12a) T μ ν , ν = 0 (18.12b) body T μ 0 ( t , x ) d x d y d z P μ = const ; (18.13a) ( x α T β μ x β T α μ ) , μ = 0 , (18.13b) body ( x α T β 0 x β T α 0 ) d x d y d z J α β = const. (18.12a) T μ ν , ν = 0 (18.12b) body  T μ 0 ( t , x ) d x d y d z P μ =  const  ; (18.13a) x α T β μ x β T α μ , μ = 0 , (18.13b) body  x α T β 0 x β T α 0 d x d y d z J α β =  const.  {:[(18.12a)T^(mu nu)_(,nu)=0],[(18.12b)int_("body ")T^(mu0)(t","x)dxdydz-=P^(mu)=" const ";],[(18.13a)(x^(alpha)T^(beta mu)-x^(beta)T^(alpha mu))","mu=0","],[(18.13b)int_("body ")(x^(alpha)T^(beta0)-x^(beta)T^(alpha0))dxdydz-=J^(alpha beta)=" const. "]:}\begin{gather*} T^{\mu \nu}{ }_{, \nu}=0 \tag{18.12a}\\ \int_{\text {body }} T^{\mu 0}(t, x) d x d y d z \equiv P^{\mu}=\text { const } ; \tag{18.12b}\\ \left(x^{\alpha} T^{\beta \mu}-x^{\beta} T^{\alpha \mu}\right), \mu=0, \tag{18.13a}\\ \int_{\text {body }}\left(x^{\alpha} T^{\beta 0}-x^{\beta} T^{\alpha 0}\right) d x d y d z \equiv J^{\alpha \beta}=\text { const. } \tag{18.13b} \end{gather*}(18.12a)Tμν,ν=0(18.12b)body Tμ0(t,x)dxdydzPμ= const ;(18.13a)(xαTβμxβTαμ),μ=0,(18.13b)body (xαTβ0xβTα0)dxdydzJαβ= const. 
(See § 5.11 § 5.11 §5.11\S 5.11§5.11 for the basic properties of angular momentum in special relativity. The angular momentum here is calculated relative to the origin of the coordinate system.) Now it is important that the stress-energy components T μ ν T μ ν T^(mu nu)T^{\mu \nu}Tμν, which appear in the linearized field equations (18.7) and in these conservation laws, are precisely the components one would calculate using special relativity (with g μ ν = η μ ν g μ ν = η μ ν g_(mu nu)=eta_(mu nu)g_{\mu \nu}=\eta_{\mu \nu}gμν=ημν ). As a result, the energy-momentum conservation formulated here contains no contributions or effects of gravity! From this one sees that linearized theory assumes that gravitational forces do no significant work. For example, energy losses due to gravitational radiation-damping forces are neglected by linearized theory. Similarly, conservation of 4-momentum P μ P μ P^(mu)P^{\mu}Pμ for each of the bodies acting as sources of h μ ν h μ ν h_(mu nu)h_{\mu \nu}hμν means that each body moves along a geodesic of η μ ν η μ ν eta_(mu nu)\eta_{\mu \nu}ημν (straight lines in the nearly Lorentz coordinate system) rather than along a geodesic of g μ ν = η μ ν + h μ ν g μ ν = η μ ν + h μ ν g_(mu nu)=eta_(mu nu)+h_(mu nu)g_{\mu \nu}=\eta_{\mu \nu}+h_{\mu \nu}gμν=ημν+hμν. Thus, linearized theory can be used to calculate the motion of test particles and fields, using g μ ν = η μ ν + h μ ν g μ ν = η μ ν + h μ ν g_(mu nu)=eta_(mu nu)+h_(mu nu)g_{\mu \nu}=\eta_{\mu \nu}+h_{\mu \nu}gμν=ημν+hμν; but to include gravitational corrections to the motion of the sources themselves-to allow them to satisfy T μ ν ; ν = 0 T μ ν ; ν = 0 T^(mu nu)_(;nu)=0T^{\mu \nu}{ }_{; \nu}=0Tμν;ν=0 rather than T μ ν , ν = 0 T μ ν , ν = 0 T^(mu nu)_(,nu)=0T^{\mu \nu}{ }_{, \nu}=0Tμν,ν=0-one must reinsert into the field equations the nonlinear terms that linearized theory discards. (See, e.g., Chapter 20 on conservation laws; § § 36.9 36.11 § § 36.9 36.11 §§36.9-36.11\S \S 36.9-36.11§§36.936.11 on the generation of gravitational waves and radiation reaction; and Chapter 39 on the post-Newtonian approximation.)
Conservation of 4-momentum and angular momentum in linearized theory
Limit on validity of linearized theory: gravity must not affect motions of sources significantly
The energy, momentum, and angular momentum radiated by gravitational waves in linearized theory can be calculated by special-relativistic methods analogous to those used in electromagnetic theory for electromagnetic waves [Fiertz and Pauli (1939)], but it will be more informative and powerful to use a fully gravitational approach (Chapters 35 and 36 ).

EXERCISES

Exercise 18.4. SPACETIME CURVATURE FOR A PLANE GRAVITATIONAL WAVE

Calculate the components of the Riemann curvature tensor [equations (18.9)] for the gravitational plane wave (18.11). [Answer:
R x 0 x 0 = R y 0 y 0 = R x 0 x z = + R y 0 y z = + R x z x z = R y z y z = 1 4 ( h ¯ x x h ¯ y y ) , t R x 0 y 0 = R x 0 y z = + R x z y z = R x z y 0 = 1 2 h ¯ x y , t t R x 0 x 0 = R y 0 y 0 = R x 0 x z = + R y 0 y z = + R x z x z = R y z y z = 1 4 h ¯ x x h ¯ y y , t R x 0 y 0 = R x 0 y z = + R x z y z = R x z y 0 = 1 2 h ¯ x y , t t {:[R_(x0x0)=-R_(y0y0)=-R_(x0xz)=+R_(y0yz)=+R_(xzxz)=-R_(yzyz)=-(1)/(4)( bar(h)_(xx)- bar(h)_(yy))_(,t)],[R_(x0y0)=-R_(x0yz)=+R_(xzyz)=-R_(xzy0)=-(1)/(2) bar(h)_(xy,tt)]:}\begin{aligned} & R_{x 0 x 0}=-R_{y 0 y 0}=-R_{x 0 x z}=+R_{y 0 y z}=+R_{x z x z}=-R_{y z y z}=-\frac{1}{4}\left(\bar{h}_{x x}-\bar{h}_{y y}\right)_{, t} \\ & R_{x 0 y 0}=-R_{x 0 y z}=+R_{x z y z}=-R_{x z y 0}=-\frac{1}{2} \bar{h}_{x y, t t} \end{aligned}Rx0x0=Ry0y0=Rx0xz=+Ry0yz=+Rxzxz=Ryzyz=14(h¯xxh¯yy),tRx0y0=Rx0yz=+Rxzyz=Rxzy0=12h¯xy,tt
all other components vanish except those obtainable from the above by the symmetries R α β γ δ = R [ α β ] [ y δ ] = R γ δ α β R α β γ δ = R [ α β ] [ y δ ] = R γ δ α β R_(alpha beta gamma delta)=R_([alpha beta][y delta])=R_(gamma delta alpha beta)R_{\alpha \beta \gamma \delta}=R_{[\alpha \beta][y \delta]}=R_{\gamma \delta \alpha \beta}Rαβγδ=R[αβ][yδ]=Rγδαβ.

Exercise 18.5. A PRIMITIVE GRAVITATIONAL-WAVE DETECTOR (see Figure 18.1)

Two beads slide almost freely on a smooth stick; only slight friction impedes their sliding. The stick falls freely through spacetime, with its center moving along a geodesic and its ends attached to gyroscopes, so they do not rotate. The beads are positioned equidistant (distance 1 2 1 2 (1)/(2)ℓ\frac{1}{2} \ell12 ) from the stick's center. Plane gravitational waves [equation (18.11) and exercise 18.4], impinging on the stick, push the beads back and forth ("geodesic deviation"; "tidal gravitational forces"). The resultant friction of beads on stick heats the stick; and the passage of the waves is detected by measuring the rise in stick temperature.* (Of course, this is not the best of all conceivable designs!) Neglecting the effect of friction on the beads' motion, calculate the proper distance separating them as a function of time. [Hints: Let ξ ξ xi\boldsymbol{\xi}ξ be the separation between the beads; and let n = ξ / | ξ | n = ξ / | ξ | n=xi//|xi|\boldsymbol{n}=\boldsymbol{\xi} /|\boldsymbol{\xi}|n=ξ/|ξ| be a unit vector that points along the stick in the stick's own rest frame. Then their separation has magnitude = ξ n = ξ n ℓ=xi*n\ell=\boldsymbol{\xi} \cdot \boldsymbol{n}=ξn. The fact that the stick is nonrotating is embodied in a parallel-transport law for n , u n = 0 n , u n = 0 n,grad_(u)n=0\boldsymbol{n}, \boldsymbol{\nabla}_{\boldsymbol{u}} \boldsymbol{n}=0n,un=0. ("FermiWalker transport" of § § 6.5 , 6.6 § § 6.5 , 6.6 §§6.5,6.6\S \S 6.5,6.6§§6.5,6.6, and 13.6 reduces to parallel transport, because the stick moves along a geodesic with a = u u = 0 a = u u = 0 a=grad_(u)u=0\boldsymbol{a}=\boldsymbol{\nabla}_{\boldsymbol{u}} \boldsymbol{u}=0a=uu=0.) Thus,
d / d τ = u ( ξ n ) = ( u ξ ) n d 2 / d τ 2 = u u ( ξ n ) = ( u u ξ ) n d / d τ = u ( ξ n ) = u ξ n d 2 / d τ 2 = u u ( ξ n ) = u u ξ n {:[dℓ//d tau=grad_(u)(xi*n)=(grad_(u)xi)*n],[d^(2)ℓ//dtau^(2)=grad_(u)grad_(u)(xi*n)=(grad_(u)grad_(u)xi)*n]:}\begin{aligned} d \ell / d \tau & =\boldsymbol{\nabla}_{u}(\xi \cdot \boldsymbol{n})=\left(\boldsymbol{\nabla}_{u} \boldsymbol{\xi}\right) \cdot \boldsymbol{n} \\ d^{2} \ell / d \tau^{2} & =\boldsymbol{\nabla}_{\boldsymbol{u}} \boldsymbol{\nabla}_{u}(\xi \cdot \boldsymbol{n})=\left(\boldsymbol{\nabla}_{u} \boldsymbol{\nabla}_{u} \xi\right) \cdot \boldsymbol{n} \end{aligned}d/dτ=u(ξn)=(uξ)nd2/dτ2=uu(ξn)=(uuξ)n
where τ τ tau\tauτ is the stick's proper time. But u u ξ u u ξ grad_(u)grad_(u)xi\nabla_{\boldsymbol{u}} \boldsymbol{\nabla}_{\boldsymbol{u}} \xiuuξ is produced by the Riemann curvature of the wave (geodesic deviation):
u u ξ = projection along n of [ R i e m a n n ( , u , ξ , u ) ] . u u ξ =  projection along  n  of  [ R i e m a n n ( , u , ξ , u ) ] grad_(u)grad_(u)xi=" projection along "n" of "[-Riemann(dots,u,xi,u)]". "\boldsymbol{\nabla}_{\boldsymbol{u}} \boldsymbol{\nabla}_{u} \xi=\text { projection along } \boldsymbol{n} \text { of }[-\boldsymbol{\operatorname { R i e m a n n }}(\ldots, \boldsymbol{u}, \xi, \boldsymbol{u})] \text {. }uuξ= projection along n of [Riemann(,u,ξ,u)]
(The geodesic-deviation forces perpendicular to the stick, i.e., perpendicular to n n n\boldsymbol{n}n, are coun-
Figure 18.1.
A primitive detector for gravitational waves, consisting of a beaded stick with gyroscopes on its ends [Bondi (1957)]. See exercise 18.5 for discussion.
terbalanced by the stick's pushing back on the beads to stop them from passing through it-no penetration of matter by matter!) Thus,
d 2 / d τ 2 = R i e m a n n ( , u , ξ , u ) n = R i e m a n n ( n , u , ξ , u ) d 2 / d τ 2 = R i e m a n n ( , u , ξ , u ) n = R i e m a n n ( n , u , ξ , u ) d^(2)ℓ//dtau^(2)=-Riemann(dots,u,xi,u)*n=-Riemann(n,u,xi,u)d^{2} \ell / d \tau^{2}=-\boldsymbol{R i e m a n n}(\ldots, \boldsymbol{u}, \boldsymbol{\xi}, \boldsymbol{u}) \cdot \boldsymbol{n}=-\boldsymbol{R i e m a n n}(\boldsymbol{n}, \boldsymbol{u}, \boldsymbol{\xi}, \boldsymbol{u})d2/dτ2=Riemann(,u,ξ,u)n=Riemann(n,u,ξ,u)
Evaluate this acceleration in the stick's local Lorentz frame. Orient the coordinates so the waves propagate in the z z zzz-direction and the stick's direction has components n z = cos θ n z = cos θ n^(z)=cos thetan^{z}=\cos \thetanz=cosθ, n x = sin θ cos ϕ , n y = sin θ sin ϕ n x = sin θ cos ϕ , n y = sin θ sin ϕ n^(x)=sin theta cos phi,n^(y)=sin theta sin phin^{x}=\sin \theta \cos \phi, n^{y}=\sin \theta \sin \phinx=sinθcosϕ,ny=sinθsinϕ. Solve the resulting differential equation for ( τ ) ( τ ) ℓ(tau)\ell(\tau)(τ).] [Answer:
= 0 [ 1 + 1 4 ( h ¯ x x h ¯ y y ) sin 2 θ cos 2 ϕ + 1 2 h ¯ x y sin 2 θ sin 2 ϕ ] , = 0 1 + 1 4 h ¯ x x h ¯ y y sin 2 θ cos 2 ϕ + 1 2 h ¯ x y sin 2 θ sin 2 ϕ , ℓ=ℓ_(0)[1+(1)/(4)( bar(h)_(xx)- bar(h)_(yy))sin^(2)theta cos 2phi+(1)/(2) bar(h)_(xy)sin^(2)theta sin 2phi],\ell=\ell_{0}\left[1+\frac{1}{4}\left(\bar{h}_{x x}-\bar{h}_{y y}\right) \sin ^{2} \theta \cos 2 \phi+\frac{1}{2} \bar{h}_{x y} \sin ^{2} \theta \sin 2 \phi\right],=0[1+14(h¯xxh¯yy)sin2θcos2ϕ+12h¯xysin2θsin2ϕ],
where h ¯ j k h ¯ j k bar(h)_(jk)\bar{h}_{j k}h¯jk are evaluated on the stick's world line ( x = y = z = 0 ) ( x = y = z = 0 ) (x=y=z=0)(x=y=z=0)(x=y=z=0). Notice that, if the stick is oriented along the direction of wave propagation (if θ = 0 θ = 0 theta=0\theta=0θ=0 ), the beads do not move. In this sense, the effect of the waves (geodesic deviation) is purely transverse. For further discussion, see § § 35.4 § § 35.4 §§35.4\S \S 35.4§§35.4 to 35.6 .]

§18.4. NEARLY NEWTONIAN GRAVITATIONAL FIELDS

The general solution to the linearized field equations in Lorentz gauge [equations ( 18.8 a , b ) ] ( 18.8 a , b ) ] (18.8a,b)](18.8 \mathrm{a}, \mathrm{b})](18.8a,b)] lends itself to expression as a retarded integral of the form familiar from electromagnetic theory:
(18.14) h ¯ μ ν ( t , x ) = 4 T μ ν ( t | x x | , x ) | x x | d 3 x (18.14) h ¯ μ ν ( t , x ) = 4 T μ ν t x x , x x x d 3 x {:(18.14) bar(h)_(mu nu)(t","x)=int(4T_(mu nu)(t-|x-x^(')|,x^(')))/(|x-x^(')|)d^(3)x^('):}\begin{equation*} \bar{h}_{\mu \nu}(t, x)=\int \frac{4 T_{\mu \nu}\left(t-\left|x-x^{\prime}\right|, x^{\prime}\right)}{\left|x-x^{\prime}\right|} d^{3} x^{\prime} \tag{18.14} \end{equation*}(18.14)h¯μν(t,x)=4Tμν(t|xx|,x)|xx|d3x
The gravitational-wave aspects of this solution will be studied in Chapter 36. Here focus attention on a nearly Newtonian source: T 00 | T 0 j | , T 00 | T j k | T 00 T 0 j , T 00 T j k T_(00)≫|T_(0j)|,T_(00)≫|T_(jk)|T_{00} \gg\left|T_{0 j}\right|, T_{00} \gg\left|T_{j k}\right|T00|T0j|,T00|Tjk|, and velocities slow enough that retardation is negligible. In this case, (18.14) reduces to
(18.15a) h ¯ 00 = 4 Φ , h ¯ 0 j = h ¯ j k = 0 (18.15b) Φ ( t , x ) = T 00 ( t , x ) | x x | d 3 x = Newtonian potential. (18.15a) h ¯ 00 = 4 Φ , h ¯ 0 j = h ¯ j k = 0 (18.15b) Φ ( t , x ) = T 00 t , x x x d 3 x =  Newtonian potential.  {:[(18.15a) bar(h)_(00)=-4Phi","quad bar(h)_(0j)= bar(h)_(jk)=0],[(18.15b)Phi(t","x)=-int(T_(00)(t,x^(')))/(|x-x^(')|)d^(3)x^(')=" Newtonian potential. "]:}\begin{gather*} \bar{h}_{00}=-4 \Phi, \quad \bar{h}_{0 j}=\bar{h}_{j k}=0 \tag{18.15a}\\ \Phi(t, \boldsymbol{x})=-\int \frac{T_{00}\left(t, \boldsymbol{x}^{\prime}\right)}{\left|\boldsymbol{x}-\boldsymbol{x}^{\prime}\right|} d^{3} x^{\prime}=\text { Newtonian potential. } \tag{18.15b} \end{gather*}(18.15a)h¯00=4Φ,h¯0j=h¯jk=0(18.15b)Φ(t,x)=T00(t,x)|xx|d3x= Newtonian potential. 
The corresponding metric ( 18.8 c ) ( 18.8 c ) (18.8c)(18.8 \mathrm{c})(18.8c) is
(18.15c) d s 2 = ( 1 + 2 Φ ) d t 2 + ( 1 2 Φ ) ( d x 2 + d y 2 + d z 2 ) ( 1 2 M / r ) d t 2 + ( 1 + 2 M / r ) ( d x 2 + d y 2 + d z 2 ) far from source. (18.15c) d s 2 = ( 1 + 2 Φ ) d t 2 + ( 1 2 Φ ) d x 2 + d y 2 + d z 2 ( 1 2 M / r ) d t 2 + ( 1 + 2 M / r ) d x 2 + d y 2 + d z 2  far from source.  {:[(18.15c)ds^(2)=-(1+2Phi)dt^(2)+(1-2Phi)(dx^(2)+dy^(2)+dz^(2))],[~~-(1-2M//r)dt^(2)+(1+2M//r)(dx^(2)+dy^(2)+dz^(2))" far from source. "]:}\begin{align*} d s^{2} & =-(1+2 \Phi) d t^{2}+(1-2 \Phi)\left(d x^{2}+d y^{2}+d z^{2}\right) \tag{18.15c}\\ & \approx-(1-2 M / r) d t^{2}+(1+2 M / r)\left(d x^{2}+d y^{2}+d z^{2}\right) \text { far from source. } \end{align*}(18.15c)ds2=(1+2Φ)dt2+(12Φ)(dx2+dy2+dz2)(12M/r)dt2+(1+2M/r)(dx2+dy2+dz2) far from source. 
Retarded-integral solution of linearized field equation
Newtonian gravity as a limit of linearized theory
Bending of light and gravitational redshift predicted by linearized theory
The errors in this metric are: (1) missing corrections of order Φ 2 Φ 2 Phi^(2)\Phi^{2}Φ2 due to nonlinearities of which linearized theory is oblivious; (2) missing corrections due to setting h ¯ 0 j = 0 h ¯ 0 j = 0 bar(h)_(0j)=0\bar{h}_{0 j}=0h¯0j=0 (these are of order h ¯ 0 j Φ v h ¯ 0 j Φ v bar(h)_(0j)∼Phi v\bar{h}_{0 j} \sim \Phi vh¯0jΦv, where v | T 0 j | / T 00 v T 0 j / T 00 v∼|T_(0j)|//T_(00)v \sim\left|T_{0 j}\right| / T_{00}v|T0j|/T00 is a typical velocity in the source); (3) missing corrections due to setting h ¯ j k = 0 h ¯ j k = 0 bar(h)_(jk)=0\bar{h}_{j k}=0h¯jk=0 [these are of order h ¯ j k Φ ( | T j k | / T 00 ) ] h ¯ j k Φ T j k / T 00 {: bar(h)_(jk)∼Phi(|T_(jk)|//T_(00))]\left.\bar{h}_{j k} \sim \Phi\left(\left|T_{j k}\right| / T_{00}\right)\right]h¯jkΦ(|Tjk|/T00)]. In the solar system all these errors are 10 12 10 12 ∼10^(-12)\sim 10^{-12}1012, whereas Φ 10 6 Φ 10 6 Phi∼10^(-6)\Phi \sim 10^{-6}Φ106.
Passive correspondence with Newtonian theory demanded only that g 00 = g 00 = g_(00)=g_{00}=g00= ( 1 + 2 Φ ) ( 1 + 2 Φ ) -(1+2Phi)-(1+2 \Phi)(1+2Φ); see equation (17.19). However, linearized theory determines all the metric coefficients, up to errors of Φ v , Φ 2 Φ v , Φ 2 ∼Phi v,∼Phi^(2)\sim \Phi v, \sim \Phi^{2}Φv,Φ2, and Φ ( | T j k | / T 00 ) Φ T j k / T 00 ∼Phi(|T_(jk)|//T_(00))\sim \boldsymbol{\Phi}\left(\left|T_{j k}\right| / T_{00}\right)Φ(|Tjk|/T00). This is sufficient accuracy to predict correctly (fractional errors 10 6 10 6 ∼10^(-6)\sim 10^{-6}106 ) the bending of light and the gravitational redshift in the solar system, but not perihelion shifts.

EXERCISES

Exercise 18.6. BENDING OF LIGHT BY THE SUN

To high precision, the sun is static and spherical, so its external line element is ( 18.15 c ) ( 18.15 c ) (18.15c)(18.15 \mathrm{c})(18.15c) with Φ = M / r Φ = M / r Phi=-M//r\Phi=-M / rΦ=M/r; i.e.,
d s 2 = ( 1 2 M / r ) d t 2 + ( 1 + 2 M / r ) ( d x 2 + d y 2 + d z 2 ) d s 2 = ( 1 2 M / r ) d t 2 + ( 1 + 2 M / r ) d x 2 + d y 2 + d z 2 ds^(2)=-(1-2M//r)dt^(2)+(1+2M//r)(dx^(2)+dy^(2)+dz^(2))d s^{2}=-(1-2 M / r) d t^{2}+(1+2 M / r)\left(d x^{2}+d y^{2}+d z^{2}\right)ds2=(12M/r)dt2+(1+2M/r)(dx2+dy2+dz2) everywhere outside sun.
A photon moving in the equatorial plane ( z = 0 ) ( z = 0 ) (z=0)(z=0)(z=0) of this curved spacetime gets deflected very slightly from the world line
(18.17) x = t , y = b "impact parameter," z = 0 (18.17) x = t , y = b  "impact parameter,"  z = 0 {:(18.17)x=t","quad y=b-=" "impact parameter," "quad z=0:}\begin{equation*} x=t, \quad y=b \equiv \text { "impact parameter," } \quad z=0 \tag{18.17} \end{equation*}(18.17)x=t,y=b "impact parameter," z=0
Calculate the amount of deflection as follows.
(a) Write down the geodesic equation (16.4a) for the photon's world line,
(18.18) d p α d λ + Γ α β γ p β p γ = 0 (18.18) d p α d λ + Γ α β γ p β p γ = 0 {:(18.18)(dp^(alpha))/(dlambda^(**))+Gamma^(alpha)_(beta gamma)p^(beta)p^(gamma)=0:}\begin{equation*} \frac{d p^{\alpha}}{d \lambda^{*}}+\Gamma^{\alpha}{ }_{\beta \gamma} p^{\beta} p^{\gamma}=0 \tag{18.18} \end{equation*}(18.18)dpαdλ+Γαβγpβpγ=0
[Here p = d / d λ = ( 4 p = d / d λ = ( 4 p=d//dlambda^(**)=(4-\boldsymbol{p}=d / d \lambda^{*}=(4-p=d/dλ=(4 momentum of photon ) = ( ) = ( )=()=()=( tangent vector to photon's null geodesic).]
(b) By evaluating the connection coefficients in the equatorial plane, and by using the approximate values, | p y | p 0 p x p y p 0 p x |p^(y)|≪p^(0)~~p^(x)\left|p^{y}\right| \ll p^{0} \approx p^{x}|py|p0px, of the 4 -momentum components corresponding to the approximate world line (18.17), show that
d p y d λ = 2 M b ( x 2 + b 2 ) 3 / 2 p x d x d λ , p x = p 0 [ 1 + O ( M b ) ] = const [ 1 + O ( M b ) ] d p y d λ = 2 M b x 2 + b 2 3 / 2 p x d x d λ , p x = p 0 1 + O M b = const 1 + O M b (dp^(y))/(dlambda^(**))=(-2Mb)/((x^(2)+b^(2))^(3//2))p^(x)(dx)/(dlambda^(**)),quadp^(x)=p^(0)[1+O((M)/(b))]=const[1+O((M)/(b))]\frac{d p^{y}}{d \lambda^{*}}=\frac{-2 M b}{\left(x^{2}+b^{2}\right)^{3 / 2}} p^{x} \frac{d x}{d \lambda^{*}}, \quad p^{x}=p^{0}\left[1+O\left(\frac{M}{b}\right)\right]=\mathrm{const}\left[1+O\left(\frac{M}{b}\right)\right]dpydλ=2Mb(x2+b2)3/2pxdxdλ,px=p0[1+O(Mb)]=const[1+O(Mb)]
(c) Integrate this equation for p y p y p^(y)p^{y}py, assuming p y = 0 p y = 0 p^(y)=0p^{y}=0py=0 at x = x = x=-oox=-\inftyx= (photon moving precisely in x x xxx-direction initially); thereby obtain
p y ( x = + ) = 4 M b p x p y ( x = + ) = 4 M b p x p^(y)(x=+oo)=-(4M)/(b)p^(x)p^{y}(x=+\infty)=-\frac{4 M}{b} p^{x}py(x=+)=4Mbpx
(d) Show that this corresponds to deflection of light through the angle
(18.19) Δ ϕ = 4 M / b = 1 .75 ( R / b ) (18.19) Δ ϕ = 4 M / b = 1 .75 R / b {:(18.19)Delta phi=4M//b=1^('').75(R_(o.)//b):}\begin{equation*} \Delta \phi=4 M / b=1^{\prime \prime} .75\left(R_{\odot} / b\right) \tag{18.19} \end{equation*}(18.19)Δϕ=4M/b=1.75(R/b)
where R R R_(o.)R_{\odot}R is the radius of the sun. For a comparison of this prediction with experiment, see Box 40.1.

Exercise 18.7. GRAVITATIONAL REDSHIFT

(a) Use the geodesic equation for a photon, written in the form
d p μ / d λ Γ α μ β p α p β = 0 , d p μ / d λ Γ α μ β p α p β = 0 , dp_(mu)//dlambda^(**)-Gamma^(alpha)_(mu beta)p_(alpha)p^(beta)=0,d p_{\mu} / d \lambda^{*}-\Gamma^{\alpha}{ }_{\mu \beta} p_{\alpha} p^{\beta}=0,dpμ/dλΓαμβpαpβ=0,
to prove that any photon moving freely in the sun's gravitational field [line element (18.16)] has d p 0 / d λ = 0 d p 0 / d λ = 0 dp_(0)//dlambda^(**)=0d p_{0} / d \lambda^{*}=0dp0/dλ=0; i.e.,
(18.20) p 0 = constant along photon's world line. (18.20) p 0 =  constant along photon's world line.  {:(18.20)p_(0)=" constant along photon's world line. ":}\begin{equation*} p_{0}=\text { constant along photon's world line. } \tag{18.20} \end{equation*}(18.20)p0= constant along photon's world line. 
(b) An atom at rest on the sun's surface emits a photon of wavelength λ e λ e lambda_(e)\lambda_{e}λe, as seen in its orthonormal frame. [Note:
(18.21) h v e = h / λ e = ( energy atom measures ) = p u e (18.21) h v e = h / λ e = (  energy atom measures  ) = p u e {:(18.21)hv_(e)=h//lambda_(e)=(" energy atom measures ")=-p*u_(e):}\begin{equation*} h v_{e}=h / \lambda_{e}=(\text { energy atom measures })=-\boldsymbol{p} \cdot \boldsymbol{u}_{e} \tag{18.21} \end{equation*}(18.21)hve=h/λe=( energy atom measures )=pue
where p p p\boldsymbol{p}p is the photon's 4 -momentum and u e u e u_(e)\boldsymbol{u}_{e}ue is the emitter's 4 -velocity.] An atom at rest far from the sun receives the photon, and measures its wavelength to be λ r λ r lambda_(r)\lambda_{r}λr [Note: h / λ r = h / λ r = h//lambda_(r)=h / \lambda_{r}=h/λr= p u r p u r -p*u_(r)-\boldsymbol{p} \cdot \boldsymbol{u}_{r}pur.] Show that the photon is redshifted by the amount
(18.22) z λ r λ e λ e = M R = 2 × 10 6 (18.22) z λ r λ e λ e = M R = 2 × 10 6 {:(18.22)z-=(lambda_(r)-lambda_(e))/(lambda_(e))=(M_(o.))/(R_(o.))=2xx10^(-6):}\begin{equation*} z \equiv \frac{\lambda_{r}-\lambda_{e}}{\lambda_{e}}=\frac{M_{\odot}}{R_{\odot}}=2 \times 10^{-6} \tag{18.22} \end{equation*}(18.22)zλrλeλe=MR=2×106
[Hint: u r = / t ; u e = ( 1 2 M / r ) 1 / 2 / t u r = / t ; u e = ( 1 2 M / r ) 1 / 2 / t u_(r)=del//del t;u_(e)=(1-2M//r)^(-1//2)del//del t\boldsymbol{u}_{r}=\partial / \partial t ; \boldsymbol{u}_{e}=(1-2 M / r)^{-1 / 2} \partial / \partial tur=/t;ue=(12M/r)1/2/t. Why?] For further discussion of the gravitational redshift and experimental results, see § § 7.4 § § 7.4 §§7.4\S \S 7.4§§7.4 and 38.5 ; also Figures 38.1 and 38.2.

синетет 19

MASS AND ANGULAR MOMENTUM OF A GRAVITATING SYSTEM

Metric far from a weakly gravitating system, as a power series in 1 / r 1 / r 1//r1 / r1/r :
(1) derivation

§19.1. EXTERNAL FIELD OF A WEAKLY GRAVITATING SOURCE

Consider an isolated system with gravity so weak that in calculating its structure and motion one can completely ignore self-gravitational effects. (This is true of an asteroid, and of a nebula with high-energy electrons and protons spiraling in a magnetic field; it is not true of the Earth or the sun.) Assume nothing else about the system-for example, by contrast with Newtonian theory, allow velocities to be arbitrarily close to the speed of light, and allow stresses T j k T j k T^(jk)T^{j k}Tjk and momentum densities T 0 j T 0 j T^(0j)T^{0 j}T0j to be comparable to the mass-energy density T 00 T 00 T^(00)T^{00}T00.
Calculate the weak gravitational field,
(19.1) g μ ν = η μ ν + h μ ν , (19.2) h ¯ μ ν h μ ν = 4 T ¯ μ ν ( t | x x | , x ) | x x | d 3 x , (19.1) g μ ν = η μ ν + h μ ν , (19.2) h ¯ ¯ μ ν h μ ν = 4 T ¯ μ ν t x x , x x x d 3 x , {:[(19.1)g_(mu nu)=eta_(mu nu)+h_(mu nu)","],[(19.2) bar(bar(h))_(mu nu)-=h_(mu nu)=int(4 bar(T)_(mu nu)(t-|x-x^(')|,x^(')))/(|x-x^(')|)d^(3)x^(')","]:}\begin{gather*} g_{\mu \nu}=\eta_{\mu \nu}+h_{\mu \nu}, \tag{19.1}\\ \overline{\bar{h}}_{\mu \nu} \equiv h_{\mu \nu}=\int \frac{4 \bar{T}_{\mu \nu}\left(t-\left|\boldsymbol{x}-\boldsymbol{x}^{\prime}\right|, \boldsymbol{x}^{\prime}\right)}{\left|\boldsymbol{x}-\boldsymbol{x}^{\prime}\right|} d^{3} x^{\prime}, \tag{19.2} \end{gather*}(19.1)gμν=ημν+hμν,(19.2)h¯μνhμν=4T¯μν(t|xx|,x)|xx|d3x,
produced by such a system [see "barred" version of equation (18.14)]. Restrict attention to the spacetime region far outside the system, and expand h μ ν h μ ν h_(mu nu)h_{\mu \nu}hμν in powers of x / r x / | x | x / r x / | x | x^(')//r-=x^(')//|x|\boldsymbol{x}^{\prime} / r \equiv \boldsymbol{x}^{\prime} /|\boldsymbol{x}|x/rx/|x|, using the relations
(19.3a) T ¯ μ ν ( t | x x | , x ) = n = 0 1 n ! [ n t n T ¯ μ ν ( t r , x ) ] ( r | x x | ) n , (19.3b) r | x x | = x j ( x j r ) + 1 2 x j x k r ( x j x k r 2 δ j k r 2 ) + , (19.3c) 1 | x x | = 1 r + x j r 2 x j r + 1 2 x j x k r 3 ( 3 x j x k r 2 δ j k ) r 2 + (19.3a) T ¯ μ ν t x x , x = n = 0 1 n ! n t n T ¯ μ ν t r , x r x x n , (19.3b) r x x = x j x j r + 1 2 x j x k r x j x k r 2 δ j k r 2 + , (19.3c) 1 x x = 1 r + x j r 2 x j r + 1 2 x j x k r 3 3 x j x k r 2 δ j k r 2 + {:[(19.3a) bar(T)_(mu nu)(t-|x-x^(')|,x^('))=sum_(n=0)^(oo)(1)/(n!)[(del^(n))/(delt^(n)) bar(T)_(mu nu)(t-r,x^('))](r-|x-x^(')|)^(n)","],[(19.3b)r-|x-x^(')|=x^(j)((x^(j^(')))/(r))+(1)/(2)(x^(j)x^(k))/(r)((x^(j^('))x^(k^('))-r^('2)delta_(jk))/(r^(2)))+cdots","],[(19.3c)(1)/(|x-x^(')|)=(1)/(r)+(x^(j))/(r^(2))(x^(j^(')))/(r)+(1)/(2)(x^(j)x^(k))/(r^(3))((3x^(j^('))x^(k^('))-r^('2)delta_(jk)))/(r^(2))+cdots]:}\begin{gather*} \bar{T}_{\mu \nu}\left(t-\left|\boldsymbol{x}-\boldsymbol{x}^{\prime}\right|, \boldsymbol{x}^{\prime}\right)=\sum_{n=0}^{\infty} \frac{1}{n!}\left[\frac{\partial^{n}}{\partial t^{n}} \bar{T}_{\mu \nu}\left(t-r, \boldsymbol{x}^{\prime}\right)\right]\left(r-\left|\boldsymbol{x}-\boldsymbol{x}^{\prime}\right|\right)^{n}, \tag{19.3a}\\ r-\left|\boldsymbol{x}-\boldsymbol{x}^{\prime}\right|=x^{j}\left(\frac{x^{j^{\prime}}}{r}\right)+\frac{1}{2} \frac{x^{j} x^{k}}{r}\left(\frac{x^{j^{\prime}} x^{k^{\prime}}-r^{\prime 2} \delta_{j k}}{r^{2}}\right)+\cdots, \tag{19.3b}\\ \frac{1}{\left|\boldsymbol{x}-\boldsymbol{x}^{\prime}\right|}=\frac{1}{r}+\frac{x^{j}}{r^{2}} \frac{x^{j^{\prime}}}{r}+\frac{1}{2} \frac{x^{j} x^{k}}{r^{3}} \frac{\left(3 x^{j^{\prime}} x^{k^{\prime}}-r^{\prime 2} \delta_{j k}\right)}{r^{2}}+\cdots \tag{19.3c} \end{gather*}(19.3a)T¯μν(t|xx|,x)=n=01n![ntnT¯μν(tr,x)](r|xx|)n,(19.3b)r|xx|=xj(xjr)+12xjxkr(xjxkr2δjkr2)+,(19.3c)1|xx|=1r+xjr2xjr+12xjxkr3(3xjxkr2δjk)r2+
Perform the calculation in the system's rest frame, where
(19.4a) P j T 0 j d 3 x = 0 (19.4a) P j T 0 j d 3 x = 0 {:(19.4a)P^(j)-=intT^(0j)d^(3)x=0:}\begin{equation*} P^{j} \equiv \int T^{0 j} d^{3} x=0 \tag{19.4a} \end{equation*}(19.4a)PjT0jd3x=0
with origin of coordinates at the system's center of mass
(19.4b) x j T 00 d 3 x = 0 (19.4b) x j T 00 d 3 x = 0 {:(19.4b)intx^(j)T^(00)d^(3)x=0:}\begin{equation*} \int x^{j} T^{00} d^{3} x=0 \tag{19.4b} \end{equation*}(19.4b)xjT00d3x=0
The result, after a change of gauge to simplify h 00 h 00 h_(00)h_{00}h00 and h 0 j h 0 j h_(0j)h_{0 j}h0j, is
d s 2 = [ 1 2 M r + 0 ( 1 r 3 ) ] d t 2 [ 4 ϵ j k k S k x r 3 + 0 ( 1 r 3 ) ] d t d x j (19.5) + [ ( 1 + 2 M r ) δ j k + ( gravitational radiation terms that die out as 0 ( 1 / r ) ) ] d x j d x k d s 2 = 1 2 M r + 0 1 r 3 d t 2 4 ϵ j k k S k x r 3 + 0 1 r 3 d t d x j (19.5) + 1 + 2 M r δ j k + (  gravitational radiation terms   that die out as  0 ( 1 / r ) ) d x j d x k {:[ds^(2)=-[1-(2M)/(r)+0((1)/(r^(3)))]dt^(2)-[4epsilon_(jkk)S^(k)(x^(ℓ))/(r^(3))+0((1)/(r^(3)))]dtdx^(j)],[(19.5)+[(1+(2M)/(r))delta_(jk)+((" gravitational radiation terms ")/(" that die out as "0(1//r)))]dx^(j)dx^(k)]:}\begin{align*} d s^{2}= & -\left[1-\frac{2 M}{r}+0\left(\frac{1}{r^{3}}\right)\right] d t^{2}-\left[4 \epsilon_{j k k} S^{k} \frac{x^{\ell}}{r^{3}}+0\left(\frac{1}{r^{3}}\right)\right] d t d x^{j} \\ & +\left[\left(1+\frac{2 M}{r}\right) \delta_{j k}+\binom{\text { gravitational radiation terms }}{\text { that die out as } 0(1 / r)}\right] d x^{j} d x^{k} \tag{19.5} \end{align*}ds2=[12Mr+0(1r3)]dt2[4ϵjkkSkxr3+0(1r3)]dtdxj(19.5)+[(1+2Mr)δjk+( gravitational radiation terms  that die out as 0(1/r))]dxjdxk
(see exercise 19.1 for derivation.) Here M M MMM and S k S k S^(k)S^{k}Sk are the body's mass and intrinsic angular momentum.
(19.6a) M = T 00 d 3 x (19.6b) S k = ϵ k ι m x l T m 0 d 3 x (19.6a) M = T 00 d 3 x (19.6b) S k = ϵ k ι m x l T m 0 d 3 x {:[(19.6a)M=intT^(00)d^(3)x],[(19.6b)S_(k)=intepsilon_(k iota m)x^(l)T^(m0)d^(3)x]:}\begin{align*} & M=\int T^{00} d^{3} x \tag{19.6a}\\ & S_{k}=\int \epsilon_{k \iota m} x^{l} T^{m 0} d^{3} x \tag{19.6b} \end{align*}(19.6a)M=T00d3x(19.6b)Sk=ϵkιmxlTm0d3x
The corresponding Newtonian potential is
(19.6c) Φ = 1 2 ( g 00 η 00 ) = M r + 0 ( 1 r 3 ) . (19.6c) Φ = 1 2 g 00 η 00 = M r + 0 1 r 3 . {:(19.6c)Phi=-(1)/(2)(g_(00)-eta_(00))=-(M)/(r)+0((1)/(r^(3))).:}\begin{equation*} \Phi=-\frac{1}{2}\left(g_{00}-\eta_{00}\right)=-\frac{M}{r}+0\left(\frac{1}{r^{3}}\right) . \tag{19.6c} \end{equation*}(19.6c)Φ=12(g00η00)=Mr+0(1r3).
Conclusion: With an appropriate choice of gauge, Φ Φ Phi\PhiΦ and g 00 g 00 g_(00)g_{00}g00 far from any weak source are time-independent and are determined uniquely by the source's mass M M MMM; g 0 j g 0 j g_(0j)g_{0 j}g0j is time-independent and is fixed by the source's intrinsic angular momentum S j S j S^(j)S^{j}Sj; but g j k g j k g_(jk)g_{j k}gjk has time-dependent terms (gravitational waves!) of 0 ( 1 / r ) 0 ( 1 / r ) 0(1//r)0(1 / r)0(1/r).
The rest of this chapter focuses on the "imprints" of the mass and angular momentum in the gravitational field; the gravitational waves will be ignored, or almost so, until Chapter 35.
How metric depends on system's mass M M MMM and angular momentum S S S\boldsymbol{S}S

Exercise 19.1. DERIVATION OF METRIC FAR OUTSIDE A WEAKLY GRAVITATING BODY

(a) Derive equation (19.5). [Hints: (1) Follow the procedure outlined in the text. (2) When calculating h 00 h 00 h_(00)h_{00}h00, write out explicitly the n = 0 n = 0 n=0n=0n=0 and n = 1 n = 1 n=1n=1n=1 terms of (19.2), to precision 0 ( 1 / r 2 ) 0 1 / r 2 0(1//r^(2))0\left(1 / r^{2}\right)0(1/r2), and simplify the n = 0 n = 0 n=0n=0n=0 term using the identities
(19.7a) T j k = 1 2 ( T 00 x j x k ) , 00 + ( T i j x k + T k x j ) , t 1 2 ( T i m x j x k ) , t m (19.7b) T x m = ( T 0 i x x m 1 2 T 0 m r 2 ) , 0 + ( T i k x k x m 1 2 T t m r 2 ) , i (19.7a) T j k = 1 2 T 00 x j x k , 00 + T i j x k + T k x j , t 1 2 T i m x j x k , t m (19.7b) T x m = T 0 i x x m 1 2 T 0 m r 2 , 0 + T i k x k x m 1 2 T t m r 2 , i {:[(19.7a)T^(jk)=(1)/(2)(T^(00)x^(j)x^(k))_(,00)+(T^(ij)x^(k)+T^(ℓk)x^(j))_(,t)-(1)/(2)(T^(im)x^(j)x^(k))_(,tm)],[(19.7b)T^(ℓℓ)x^(m)=(T^(0i)x^(ℓ)x^(m)-(1)/(2)T^(0m)r^(2))_(,0)+(T^(ik)x^(k)x^(m)-(1)/(2)T^(tm)r^(2))_(,i)]:}\begin{align*} & T^{j k}=\frac{1}{2}\left(T^{00} x^{j} x^{k}\right)_{, 00}+\left(T^{i j} x^{k}+T^{\ell k} x^{j}\right)_{, t}-\frac{1}{2}\left(T^{i m} x^{j} x^{k}\right)_{, t m} \tag{19.7a}\\ & T^{\ell \ell} x^{m}=\left(T^{0 i} x^{\ell} x^{m}-\frac{1}{2} T^{0 m} r^{2}\right)_{, 0}+\left(T^{i k} x^{k} x^{m}-\frac{1}{2} T^{t m} r^{2}\right)_{, i} \tag{19.7b} \end{align*}(19.7a)Tjk=12(T00xjxk),00+(Tijxk+Tkxj),t12(Timxjxk),tm(19.7b)Txm=(T0ixxm12T0mr2),0+(Tikxkxm12Ttmr2),i
(Verify that these identities follow from T α β , β = 0 T α β , β = 0 T^(alpha beta)_(,beta)=0T^{\alpha \beta}{ }_{, \beta}=0Tαβ,β=0.) (3) When calculating h 0 m h 0 m h_(0m)h_{0 m}h0m, write out explicitly the n = 0 n = 0 n=0n=0n=0 term of (19.2), to precision 0 ( 1 / r 2 ) 0 1 / r 2 0(1//r^(2))0\left(1 / r^{2}\right)0(1/r2), and simplify it using the identity
(19.7c) T 0 k x j + T 0 j x k = ( T 00 x j x k ) , 0 + ( T 01 x j x k ) , l . (19.7c) T 0 k x j + T 0 j x k = T 00 x j x k , 0 + T 01 x j x k , l . {:(19.7c)T^(0k)x^(j)+T^(0j)x^(k)=(T^(00)x^(j)x^(k))_(,0)+(T^(01)x^(j)x^(k))_(,l).:}\begin{equation*} T^{0 k} x^{j}+T^{0 j} x^{k}=\left(T^{00} x^{j} x^{k}\right)_{, 0}+\left(T^{01} x^{j} x^{k}\right)_{, l} . \tag{19.7c} \end{equation*}(19.7c)T0kxj+T0jxk=(T00xjxk),0+(T01xjxk),l.
(Verify that this follows from T α β , β = 0 T α β , β = 0 T^(alpha beta)_(,beta)=0T^{\alpha \beta}{ }_{, \beta}=0Tαβ,β=0.) (4) Simplify h 00 h 00 h_(00)h_{00}h00 and h 0 m h 0 m h_(0m)h_{0 m}h0m by the gauge transformation generated by
ξ 0 = 1 2 r t T 00 r r 2 d 3 x + x j r 3 ( T 0 k x k x j 1 2 T 0 j r 2 ) d 3 x + ( T 00 + T 11 ) [ x j x j r 2 + ( 3 x j x k r 2 δ j k ) x i x k 2 r 4 ] d 3 x + n = 2 1 n ! n 1 t n 1 ( T 00 + T k k ) ( r | x x | ) n | x x | d 3 x , ξ m = 2 x j r 3 T 00 x j x m d 3 x + 4 n = 1 1 n ! n 1 t n 1 T 0 m ( r | x x | ) n | x x | d 3 x + x m r ξ 0 1 2 ( 1 r ) . m T 00 r 2 d 3 x ( x k r 2 ) , m ( T 0 j x x k 1 2 T 0 k r 2 ) d 3 x n = 2 x 1 n ! n 2 t n 2 ( T 00 + T k k ) [ ( r x x ) n | x x | ] , m d 3 x ξ 0 = 1 2 r t T 00 r r 2 d 3 x + x j r 3 T 0 k x k x j 1 2 T 0 j r 2 d 3 x + T 00 + T 11 x j x j r 2 + 3 x j x k r 2 δ j k x i x k 2 r 4 d 3 x + n = 2 1 n ! n 1 t n 1 T 00 + T k k r x x n x x d 3 x , ξ m = 2 x j r 3 T 00 x j x m d 3 x + 4 n = 1 1 n ! n 1 t n 1 T 0 m r x x n x x d 3 x + x m r ξ 0 1 2 1 r . m T 00 r 2 d 3 x x k r 2 , m T 0 j x x k 1 2 T 0 k r 2 d 3 x n = 2 x 1 n ! n 2 t n 2 T 00 + T k k r x x n x x , m d 3 x {:[xi_(0)=(1)/(2r)(del)/(del t)intT^(00r^('))r^(2)d^(3)x^(')+(x^(j))/(r^(3))int(T^(0k^('))x^(k^('))x^(j^('))-(1)/(2)T^(0j^(')r^('2)))d^(3)x^(')],[+int(T_(00)^(')+T_(11)^('))[(x^(j)x^(j^(')))/(r^(2))+((3x^(j^('))x^(k^('))-r^('2)delta_(jk))x^(i)x^(k))/(2r^(4))]d^(3)x^(')],[+sum_(n=2)^(oo)(1)/(n!)(del^(n-1))/(delt^(n-1))int(T_(00)^(')+T_(kk))((r-|x-x^(')|)^(n))/(|x-x^(')|)d^(3)x^(')","],[xi_(m)=-(2x^(j))/(r^(3))intT_(00)^(')x^(j^('))x^(m^('))d^(3)x^(')+4sum_(n=1)^(oo)(1)/(n!)(del^(n-1))/(delt^(n-1))intT_(0m)((r-|x-x^(')|)^(n))/(|x-x^(')|)d^(3)x^(')],[+(x^(m))/(r)xi_(0)-(1)/(2)((1)/(r))_(.m)intT_(00)^(')r^('2)d^(3)x^(')-((x^(k))/(r^(2)))_(,m)int(T^(0j^('))x^(')x^(k^('))-(1)/(2)T^(0k^(')r^('2)))d^(3)x^(')],[-sum_(n=2)^(x)(1)/(n!)(del^(n-2))/(delt^(n-2))int(T_(00)^(')+T_(kk^(')))[((r-∣x-x^('))^(n))/(|x-x^(')|)]_(,m)d^(3)x^(')]:}\begin{aligned} & \xi_{0}=\frac{1}{2 r} \frac{\partial}{\partial t} \int T^{00 r^{\prime}} r^{2} d^{3} x^{\prime}+\frac{x^{j}}{r^{3}} \int\left(T^{0 k^{\prime}} x^{k^{\prime}} x^{j^{\prime}}-\frac{1}{2} T^{0 j^{\prime} r^{\prime 2}}\right) d^{3} x^{\prime} \\ & +\int\left(T_{00}{ }^{\prime}+T_{11}{ }^{\prime}\right)\left[\frac{x^{j} x^{j^{\prime}}}{r^{2}}+\frac{\left(3 x^{j^{\prime}} x^{k^{\prime}}-r^{\prime 2} \delta_{j k}\right) x^{i} x^{k}}{2 r^{4}}\right] d^{3} x^{\prime} \\ & +\sum_{n=2}^{\infty} \frac{1}{n!} \frac{\partial^{n-1}}{\partial t^{n-1}} \int\left(T_{00}{ }^{\prime}+T_{k k}\right) \frac{\left(r-\left|\boldsymbol{x}-\boldsymbol{x}^{\prime}\right|\right)^{n}}{\left|\boldsymbol{x}-\boldsymbol{x}^{\prime}\right|} d^{3} x^{\prime}, \\ & \xi_{m}=-\frac{2 x^{j}}{r^{3}} \int T_{00}{ }^{\prime} x^{j^{\prime}} x^{m^{\prime}} d^{3} x^{\prime}+4 \sum_{n=1}^{\infty} \frac{1}{n!} \frac{\partial^{n-1}}{\partial t^{n-1}} \int T_{0 m} \frac{\left(r-\left|\boldsymbol{x}-\boldsymbol{x}^{\prime}\right|\right)^{n}}{\left|\boldsymbol{x}-\boldsymbol{x}^{\prime}\right|} d^{3} x^{\prime} \\ & +\frac{x^{m}}{r} \xi_{0}-\frac{1}{2}\left(\frac{1}{r}\right)_{. m} \int T_{00}{ }^{\prime} r^{\prime 2} d^{3} x^{\prime}-\left(\frac{x^{k}}{r^{2}}\right)_{, m} \int\left(T^{0 j^{\prime}} x^{\prime} x^{k^{\prime}}-\frac{1}{2} T^{0 k^{\prime} r^{\prime 2}}\right) d^{3} x^{\prime} \\ & -\sum_{n=2}^{x} \frac{1}{n!} \frac{\partial^{n-2}}{\partial t^{n-2}} \int\left(T_{00}{ }^{\prime}+T_{k k^{\prime}}\right)\left[\frac{\left(r-\mid \boldsymbol{x}-\boldsymbol{x}^{\prime}\right)^{n}}{\left|\boldsymbol{x}-\boldsymbol{x}^{\prime}\right|}\right]_{, m} d^{3} x^{\prime} \end{aligned}ξ0=12rtT00rr2d3x+xjr3(T0kxkxj12T0jr2)d3x+(T00+T11)[xjxjr2+(3xjxkr2δjk)xixk2r4]d3x+n=21n!n1tn1(T00+Tkk)(r|xx|)n|xx|d3x,ξm=2xjr3T00xjxmd3x+4n=11n!n1tn1T0m(r|xx|)n|xx|d3x+xmrξ012(1r).mT00r2d3x(xkr2),m(T0jxxk12T0kr2)d3xn=2x1n!n2tn2(T00+Tkk)[(rxx)n|xx|],md3x
Here T μ ν T μ ν T_(mu nu)^(')T_{\mu \nu}{ }^{\prime}Tμν denotes T μ ν ( t r , x ) T μ ν t r , x T_(mu nu)(t-r,x^('))T_{\mu \nu}\left(t-r, \boldsymbol{x}^{\prime}\right)Tμν(tr,x).]
(b) Prove that the system's mass and angular momentum are conserved. [Note: Because T α β β β = 0 T α β β β = 0 T^(alpha beta)_(beta beta)=0T^{\alpha \beta}{ }_{\beta \beta}=0Tαβββ=0 (self-gravity has negligible influence), the proof is no different here than in flat spacetime (Chapter 5).]
For a weakly gravitating system:
(1) total mass M M MMM can be measured by applying Kepler's "1-2-3" law to orbiting particles

§19.2. MEASUREMENT OF THE MASS AND ANGULAR MOMENTUM

The values of a system's mass and angular momentum can be measured by probing the imprint they leave in its external gravitational field. Of all tools one might use to probe, the simplest is a test particle in a gravitationally bound orbit. If the particle is sufficiently far from the source, its motion is affected hardly at all by the source's angular momentum or by the gravitational waves; only the spherical, Newtonian part of the gravitational field has a significant influence. Hence, the particle moves in an elliptical Keplerian orbit. To determine the source's mass M M MMM, one need only apply Kepler's third law (perhaps better called "Kepler's 1-2-3 law"):
(19.8) M = ( 2 π orbital period ) 2 ( Semi-major axis of ellipse ) 3 ; i.e., M 1 = ω 2 a 3 . (19.8) M = 2 π  orbital period  2 (  Semi-major axis   of ellipse  ) 3 ;  i.e.,  M 1 = ω 2 a 3 . {:(19.8)M=((2pi)/(" orbital period "))^(2)((" Semi-major axis ")/(" of ellipse "))^(3);quad" i.e., "M^(1)=omega^(2)a^(3).:}\begin{equation*} M=\left(\frac{2 \pi}{\text { orbital period }}\right)^{2}\binom{\text { Semi-major axis }}{\text { of ellipse }}^{3} ; \quad \text { i.e., } M^{1}=\omega^{2} a^{3} . \tag{19.8} \end{equation*}(19.8)M=(2π orbital period )2( Semi-major axis  of ellipse )3; i.e., M1=ω2a3.
The source's angular momentum is not measured quite so easily. One must use a probe that is insensitive to Newtonian gravitational effects, but "feels" the offdiagonal term,
(19.9) g 0 j = 2 ϵ j k t S k x t / r 3 (19.9) g 0 j = 2 ϵ j k t S k x t / r 3 {:(19.9)g_(0j)=-2epsilon^(jkt)S^(k)x^(t)//r^(3):}\begin{equation*} g_{0 j}=-2 \epsilon^{j k t} S^{k} x^{t} / r^{3} \tag{19.9} \end{equation*}(19.9)g0j=2ϵjktSkxt/r3
in the metric (19.5). One such probe is the precession of the perihelion of a corevolving satellite, relative to the precession for a counterrevolving satellite. A gyroscope is another such probe. Place a gyroscope at rest in the source's gravitational field. By a force applied to its center of mass, prevent it from falling. As time passes, the g 0 j g 0 j g_(0j)g_{0 j}g0j term in the metric will force the gyroscope to precess relative to the basis vectors / x j / x j del//delx^(j)\partial / \partial x^{j}/xj; and since these basis vectors are "tied" to the coordinate system, which in turn is tied to the Lorentz frames at infinity, which in turn are tied to the "fixed stars" (cf. §39.12), the precession is relative to the "fixed stars." The angular velocity of precession, as derived in exercise 19.2, is
(19.10) Ω = 1 r 3 [ S + 3 ( S x ) x r 2 ] (19.10) Ω = 1 r 3 S + 3 ( S x ) x r 2 {:(19.10)Omega=(1)/(r^(3))[-S+(3(S*x)x)/(r^(2))]:}\begin{equation*} \boldsymbol{\Omega}=\frac{1}{r^{3}}\left[-\boldsymbol{S}+\frac{3(\boldsymbol{S} \cdot \boldsymbol{x}) \boldsymbol{x}}{r^{2}}\right] \tag{19.10} \end{equation*}(19.10)Ω=1r3[S+3(Sx)xr2]
One sometimes says that the source's rotation "drags the inertial frames near the source," thereby forcing the gyroscope to precess. For further discussion, see § § 21.12 § § 21.12 §§21.12\S \S 21.12§§21.12, 40.7, and 33.4.

Exercise 19.2. GYROSCOPE PRECESSION

Derive equation (19.10) for the angular velocity of gyroscope precession. [Hints: Place an orthonormal tetrad at the gyroscope's center of mass. Tie the tetrad rigidly to the coordinate system, and hence to the "fixed stars"; more particularly, choose the tetrad to be that basis { e α ^ } e α ^ {e_( hat(alpha))}\left\{e_{\hat{\alpha}}\right\}{eα^} which is dual to the following 1-form basis:
(19.11) ω t ^ = [ 1 ( 2 M / r ) ] 1 / 2 d t + 2 ϵ j k S k ( x t / r 3 ) d x j , ω j = [ 1 + ( 2 M / r ) ] 1 / 2 d x j . (19.11) ω t ^ = [ 1 ( 2 M / r ) ] 1 / 2 d t + 2 ϵ j k S k x t / r 3 d x j , ω j = [ 1 + ( 2 M / r ) ] 1 / 2 d x j . {:(19.11)omega^( hat(t))=[1-(2M//r)]^(1//2)dt+2epsilon_(jkℓ)S^(k)(x^(t)//r^(3))dx^(j)","quadomega^(j)=[1+(2M//r)]^(1//2)dx^(j).:}\begin{equation*} \boldsymbol{\omega}^{\hat{t}}=[1-(2 M / r)]^{1 / 2} \boldsymbol{d} t+2 \epsilon_{j k \ell} S^{k}\left(x^{t} / r^{3}\right) \boldsymbol{d} x^{j}, \quad \boldsymbol{\omega}^{j}=[1+(2 M / r)]^{1 / 2} \boldsymbol{d} x^{j} . \tag{19.11} \end{equation*}(19.11)ωt^=[1(2M/r)]1/2dt+2ϵjkSk(xt/r3)dxj,ωj=[1+(2M/r)]1/2dxj.
The spatial legs of the tetrad, e j ^ e j ^ e_( hat(j))\boldsymbol{e}_{\hat{j}}ej^, rotate relative to the gyroscope with an angular velocity ω ω omega\omegaω given by [see equation (13.69)]
ϵ i j k ^ k ω k ^ = Γ i ^ j ^ 0 ^ . ϵ i j k ^ k ω k ^ = Γ i ^ j ^ 0 ^ -epsilon_( hat(ijk)k)omega^( hat(k))=Gamma_( hat(i) hat(j) hat(0))". "-\epsilon_{\hat{i j k} k} \omega^{\hat{k}}=\Gamma_{\hat{i} \hat{j} \hat{0}} \text {. }ϵijk^kωk^=Γi^j^0^
Consequently, the gyroscope's angular momentum vector L L L\boldsymbol{L}L precesses relative to the tetrad with angular velocity Ω = ω Ω = ω Omega=-omega\Omega=-\omegaΩ=ω :
(19.12) d L j ^ d t ^ = ϵ j k ^ ^ Ω k ^ L , ϵ i j k ^ Ω k ^ = Γ i j 0 ^ . (19.12) d L j ^ d t ^ = ϵ j k ^ ^ Ω k ^ L , ϵ i j k ^ Ω k ^ = Γ i j 0 ^ . {:(19.12)(dL^( hat(j)))/(d( hat(t)))=epsilon_( hat(j( hat(k)))darr)Omega^( hat(k))L^(ℓ)","quadepsilon_(ij hat(k))Omega^( hat(k))=Gamma_(ij hat(0)).:}\begin{equation*} \frac{d L^{\hat{j}}}{d \hat{t}}=\epsilon_{\hat{j \hat{k}} \downarrow} \Omega^{\hat{k}} L^{\ell}, \quad \epsilon_{i j \hat{k}} \Omega^{\hat{k}}=\Gamma_{i j \hat{0}} . \tag{19.12} \end{equation*}(19.12)dLj^dt^=ϵjk^^Ωk^L,ϵijk^Ωk^=Γij0^.
Calculate Γ i j ^ 0 ^ Γ i j ^ 0 ^ Gamma_( hat(ij) hat(0))\Gamma_{\hat{i j} \hat{0}}Γij^0^ for the given orthonormal frame, and thereby obtain equation (19.10) for Ω Ω Omega\boldsymbol{\Omega}Ω.]
(2) total angular momentum S S S\boldsymbol{S}S can be measured by examining the precession of gyroscopes

EXERCISE

§19.3. MASS AND ANGULAR MOMENTUM OF FULLY relativistic sources

Abandon, now, the restriction to weakly gravitating sources. Consider an isolated, gravitating system inside which spacetime may or may not be highly curved-a black hole, a neutron star, the Sun, . . . But refuse, for now, to analyze the system's interior or the "strong-field region" near the system. Instead, restrict attention to the weak
gravitational field far from the source, and analyze it using linearized theory in vacuum. Expand h μ ν h μ ν h_(mu nu)h_{\mu \nu}hμν in multipole moments and powers of 1 / r 1 / r 1//r1 / r1/r; and adjust the gauge, the Lorentz frame, and the origin of coordinates to simplify the resulting metric. The outcome of such a calculation is a gravitational field identical to that for a weak source [equation (19.5)]! (Details of the calculation are not spelled out here because of their length; but see exercise 19.3.)
But before accepting this as the distant field of an arbitrary source, one should examine the nonlinear effects in the vacuum field equations. Two types of nonlinearities turn out to be important far from the source: (1) nonlinearities in the static, Newtonian part of the metric, which generate metric corrections
δ g 00 = 2 M 2 / r 2 , δ g j k = 3 2 ( M 2 / r 2 ) δ j k , δ g 00 = 2 M 2 / r 2 , δ g j k = 3 2 M 2 / r 2 δ j k , deltag_(00)=-2M^(2)//r^(2),quad deltag_(jk)=(3)/(2)(M^(2)//r^(2))delta_(jk),\delta g_{00}=-2 M^{2} / r^{2}, \quad \delta g_{j k}=\frac{3}{2}\left(M^{2} / r^{2}\right) \delta_{j k},δg00=2M2/r2,δgjk=32(M2/r2)δjk,
(see exercise 19.3 and § 39.8 § 39.8 §39.8\S 39.8§39.8 ), thereby putting the metric into the form
d s 2 = [ 1 2 M r + 2 M 2 r 2 + 0 ( 1 r 3 ) ] d t 2 [ 4 ϵ j k l S k x l r 3 + 0 ( 1 r 3 ) ] d t d x j + [ ( 1 + 2 M r + 3 M 2 2 r 2 ) δ j k + ( gravitational radiation terms that die out as 0 ( 1 / r ) ) ] d x j d x k d s 2 = 1 2 M r + 2 M 2 r 2 + 0 1 r 3 d t 2 4 ϵ j k l S k x l r 3 + 0 1 r 3 d t d x j + 1 + 2 M r + 3 M 2 2 r 2 δ j k + (  gravitational radiation terms   that die out as  0 ( 1 / r ) ) d x j d x k {:[ds^(2)=-[1-(2M)/(r)+(2M^(2))/(r^(2))+0((1)/(r^(3)))]dt^(2)-[4epsilon_(jkl)S^(k)(x^(l))/(r^(3))+0((1)/(r^(3)))]dtdx^(j)],[+[(1+(2M)/(r)+(3M^(2))/(2r^(2)))delta_(jk)+((" gravitational radiation terms ")/(" that die out as "0(1//r)))]dx^(j)dx^(k)]:}\begin{aligned} d s^{2}= & -\left[1-\frac{2 M}{r}+\frac{2 M^{2}}{r^{2}}+0\left(\frac{1}{r^{3}}\right)\right] d t^{2}-\left[4 \epsilon_{j k l} S^{k} \frac{x^{l}}{r^{3}}+0\left(\frac{1}{r^{3}}\right)\right] d t d x^{j} \\ & +\left[\left(1+\frac{2 M}{r}+\frac{3 M^{2}}{2 r^{2}}\right) \delta_{j k}+\binom{\text { gravitational radiation terms }}{\text { that die out as } 0(1 / r)}\right] d x^{j} d x^{k} \end{aligned}ds2=[12Mr+2M2r2+0(1r3)]dt2[4ϵjklSkxlr3+0(1r3)]dtdxj+[(1+2Mr+3M22r2)δjk+( gravitational radiation terms  that die out as 0(1/r))]dxjdxk
(2) a gradual decrease in the source's mass, gradual changes in its angular momentum, and gradual changes in its "rest frame" to compensate for the mass, angular momentum, and linear momentum carried off by gravitational waves (see Box 19.1, which is best read only after finishing this section).
By measuring the distant spacetime geometry (19.13) of a given source, one cannot discover whether that source has strong internal gravity, or weak. But when one expresses the constants M M MMM and S j S j S_(j)S_{j}Sj, which determine g 00 g 00 g_(00)g_{00}g00 and g 0 j g 0 j g_(0j)g_{0 j}g0j, as integrals over the interior of the source, one discovers a crucial difference: if the internal gravity is weak, then linearized theory is valid throughout the source, and
(19.14) M = T 00 d 3 x , S j = ϵ j k x k T i 0 d 3 x (19.14) M = T 00 d 3 x , S j = ϵ j k x k T i 0 d 3 x {:(19.14)M=intT_(00)d^(3)x","quadS_(j)=intepsilon_(jkℓ)x^(k)T^(i0)d^(3)x:}\begin{equation*} M=\int T_{00} d^{3} x, \quad S_{j}=\int \epsilon_{j k \ell} x^{k} T^{i 0} d^{3} x \tag{19.14} \end{equation*}(19.14)M=T00d3x,Sj=ϵjkxkTi0d3x
but if the gravity is strong, these formulas fail. Does this failure prevent one, for strong gravity, from identifying the constants M M MMM and S j S j S_(j)S_{j}Sj of the metric (19.13) as the source's mass and angular momentum? Not at all, according to the following argument.
Consider, first, the mass of the sun. For the sun one expects Newtonian theory to be highly accurate (fractional errors M / R 10 6 M / R 10 6 ∼M_(o.)//R_(o.)∼10^(-6)\sim M_{\odot} / R_{\odot} \sim 10^{-6}M/R106 ); so one can assert that the constant M M MMM appearing in the line element (19.13) is, indeed
M = ρ d 3 x = T 00 d 3 x = total mass M = ρ d 3 x = T 00 d 3 x =  total mass  M=int rhod^(3)x=intT_(00)d^(3)x=" total mass "M=\int \rho d^{3} x=\int T_{00} d^{3} x=\text { total mass }M=ρd3x=T00d3x= total mass 
But might this assertion be wrong? To gain greater confidence and insight, adopt the viewpoint of "controlled ignorance"; i.e., do not pretend to know more than what is needed. (This style of physical argument goes back to Newton's famous "Hypotheses non fingo," i.e. "I do not feign hypotheses.") In evaluating the volume integral of T 00 T 00 T_(00)T_{00}T00 (usual Newtonian definition of M M MMM ), one needs a theory of the internal structure
of the sun. For example, one must know that the visible surface layers of the sun do not hide a massive central core, so dense and large that relativistic gravitational fields | Φ | 1 | Φ | 1 |Phi|∼1|\Phi| \sim 1|Φ|1 exist there. If one makes use in the analysis of a fluid-type stress-energy tensor T μ ν T μ ν T^(mu nu)T^{\mu \nu}Tμν, one needs to know equations of state, opacities, and theories of energy generation and transport. One needs to justify the fluid description as an adequate approximation to the atomic constitution of matter. One needs to assume that an ultimate theory of matter explaining the rest masses of protons and electrons will not assign an important fraction of this mass to strong (nonlinear) gravitational fields on a submicroscopic scale. It is plausible that one could do all this, but it is also obvious that this is not the way the mass of the sun is, in fact, determined by astronomers! Theories of stellar structure are adjusted to give the observed mass; they are not constructed to let one deduce the mass from nongravitational observations. The mass of the sun is measured in practice by studying the orbits of planets in its external gravitational field, a procedure equivalent to reading the mass M M MMM off the line element (19.13), rather than evaluating the volume integral T 00 d 3 x T 00 d 3 x intT^(00)d^(3)x\int T^{00} d^{3} xT00d3x.
To avoid all the above uncertainties, and to make theory correspond as closely as possible to experiment, one defines the "total mass-energy" M M MMM of the sun or any other body to be the constant that appears in the line element (19.13) for its distant external spacetime geometry. Similarly, one defines the body's intrinsic angular momentum as the constant 3-vector S S S\boldsymbol{S}S appearing in its line element (19.13). Operationally, the total mass-energy M M MMM is measured via Kepler's third law; the angular momentum S S S\boldsymbol{S}S is measured via its influence on the precession of a gyroscope or a planetary orbit. This is as true when the body is a black hole or a neutron star as when it is the sun.
What kind of a geometric object is the intrinsic angular momentum S S S\boldsymbol{S}S ? It is defined by measurements made far from the source, where, with receding distance, spacetime is becoming flatter and flatter (asymptotically flat). Thus, it can be regarded as a 3-vector in the "asymptotically flat spacetime" that surrounds the source. But in what Lorentz frame is S S S\boldsymbol{S}S a 3-vector? Clearly, in the asymptotic Lorentz frame where the line element (19.13) is valid; i.e., in the asymptotic Lorentz frame where the source's distant "coulomb" ("M/r") field is static; i.e., in the "asymptotic rest frame" of the source. Alternatively, one can regard S S S\boldsymbol{S}S as a 4 -vector, S S S\boldsymbol{S}S, which is purely spatial ( S 0 = 0 S 0 = 0 S^(0)=0S^{0}=0S0=0 ) in the asymptotic rest frame. If one denotes the 4 -velocity of the asymptotic rest frame by U U U\boldsymbol{U}U, then the fact that S S S\boldsymbol{S}S is purely spatial can be restated geometrically as S U = 0 S U = 0 S*U=0\boldsymbol{S} \cdot \boldsymbol{U}=0SU=0, or
(19.15) s P = 0 (19.15) s P = 0 {:(19.15)s*P=0:}\begin{equation*} \boldsymbol{s} \cdot \boldsymbol{P}=0 \tag{19.15} \end{equation*}(19.15)sP=0
where
(19.16) P M U "total 4-momentum of source" (19.16) P M U  "total 4-momentum of source"  {:(19.16)P-=MU-=" "total 4-momentum of source" ":}\begin{equation*} \boldsymbol{P} \equiv M \boldsymbol{U} \equiv \text { "total 4-momentum of source" } \tag{19.16} \end{equation*}(19.16)PMU "total 4-momentum of source" 
is still another vector residing in the asymptotically flat region of spacetime.
The total 4-momentum P P P\boldsymbol{P}P and intrinsic angular momentum S S S\boldsymbol{S}S satisfy conservation laws that are summarized in Box 19.1. These conservation laws are valuable tools in gravitation theory and relativistic astrophysics, but the derivation of these laws (Chapter 20) does not compare in priority to topics such as neutron stars and basic cosmology; so most readers will wish to skip it on a first reading of this book.
Definition of "total mass-energy" M M MMM and "angular momentum" S S S\boldsymbol{S}S in terms of external gravitational field
S S S\boldsymbol{S}S as a geometric object in an asymptotically flat region far outside source
"Asymptotic rest frame" and "total 4-momentum"
Conservation laws for total 4-momentum and angular momentum

Box 19.1 TOTAL MASS-ENERGY, 4-MOMENTUM, AND ANGULAR MOMENTUM OF AN ISOLATED SYSTEM

A. Spacetime is divided into (1) the source's interior; which is surrounded by (2) a strong-field vacuum region; which in turn is surrounded by (3) a weak-field, asymptotically flat, near-zone region; which in turn is surrounded by (4) a weakfield, asymptotically flat, radiation-zone region. This box and this chapter treat only the asymptotically flat regions. The interior and strong-field regions are treated in the next chapter.

B. The asymptotic rest frame of the source is that global, asymptotically Lorentz frame (coordinates t , x , y , z ) t , x , y , z ) t,x,y,z)t, x, y, z)t,x,y,z) in which the distant, "coulomb" part of the source's field is at rest (see diagram). The asymptotic rest frame does not extend into the strong-field region; any such extension of it would necessarily be forced by the curvature into a highly non-Lorentz, curvilinear form. The spatial origin of the asymptotic rest frame is so adjusted that the source is centered on it-i.e., that the distant Newtonian potential is Φ = M / ( x 2 + y 2 + Φ = M / x 2 + y 2 + Phi=-M//(x^(2)+y^(2)+:}\Phi=-M /\left(x^{2}+y^{2}+\right.Φ=M/(x2+y2+ z 2 ) 1 / 2 + 0 ( 1 / r 3 ) z 2 1 / 2 + 0 1 / r 3 z^(2))^(1//2)+0(1//r^(3))\left.z^{2}\right)^{1 / 2}+0\left(1 / r^{3}\right)z2)1/2+0(1/r3); i.e., that Φ Φ Phi\PhiΦ has no dipole term, D x / r 3 D x / r 3 D*x//r^(3)\boldsymbol{D} \cdot \boldsymbol{x} / r^{3}Dx/r3, such as would originate from an offset of the coordinates.
C. To the source one can attribute a total mass-energy M M MMM, a 4-velocity U U U\boldsymbol{U}U, a total 4 -momentum P P P\boldsymbol{P}P, and an intrinsic angular momentum vector, S S S\boldsymbol{S}S. The 4 -vectors U , P U , P U,P\boldsymbol{U}, \boldsymbol{P}U,P, and S S S\boldsymbol{S}S reside in the asymptotically flat region of spacetime and can be moved about freely there (negligible curvature =>\Rightarrow parallel transport around closed curves does not change U , P U , P U,P\boldsymbol{U}, \boldsymbol{P}U,P, or S ) S ) S)\boldsymbol{S})S). The source's 4 -velocity U U U\boldsymbol{U}U is defined to equal the 4 -velocity of the asymptotic rest frame ( U 0 = 1 , U = 0 U 0 = 1 , U = 0 U^(0)=1,U=0U^{0}=1, \boldsymbol{U}=0U0=1,U=0 in rest frame). The total mass-energy M M MMM is measured via Kepler's third ("1-2-3") law [equation (19.8)]. The total 4-momentum is defined by P M U P M U P-=MU\boldsymbol{P} \equiv M \boldsymbol{U}PMU. The intrinsic angular momentum S S S\boldsymbol{S}S is orthogonal to the 4-velocity U , S U = 0 U , S U = 0 U,S*U=0\boldsymbol{U}, \boldsymbol{S} \cdot \boldsymbol{U}=0U,SU=0 (so S 0 = 0 ; S 0 S 0 = 0 ; S 0 S^(0)=0;S!=0S^{0}=0 ; \boldsymbol{S} \neq 0S0=0;S0 in general in asymptotic rest frame); S S S\boldsymbol{S}S is measured via gyroscope precession or differential perihelion precession (§19.2).
In the asymptotic rest frame, with an appropriate choice of gauge (i.e., of ripples in the coordinates), the slight deviations from flat-spacetime geometry are described by the line element
d s 2 = [ 1 2 M r + 2 M 2 r 2 + 0 ( 1 r 3 ) ] d t 2 [ 4 ϵ j k S k x r 3 + 0 ( 1 r 3 ) ] d t d x j (1) + [ ( 1 + 2 M r + 3 M 2 2 r 2 ) δ j k + ( gravitational radiation terms ) ] d x j d x k d s 2 = 1 2 M r + 2 M 2 r 2 + 0 1 r 3 d t 2 4 ϵ j k S k x r 3 + 0 1 r 3 d t d x j (1) + 1 + 2 M r + 3 M 2 2 r 2 δ j k + (  gravitational radiation terms  ) d x j d x k {:[ds^(2)=-[1-(2M)/(r)+(2M^(2))/(r^(2))+0((1)/(r^(3)))]dt^(2)-[4epsilon_(jkℓ)S^(k)(x^(ℓ))/(r^(3))+0((1)/(r^(3)))]dtdx^(j)],[(1)+[(1+(2M)/(r)+(3M^(2))/(2r^(2)))delta_(jk)+(" gravitational radiation terms ")]dx^(j)dx^(k)]:}\begin{align*} d s^{2}= & -\left[1-\frac{2 M}{r}+\frac{2 M^{2}}{r^{2}}+0\left(\frac{1}{r^{3}}\right)\right] d t^{2}-\left[4 \epsilon_{j k \ell} S^{k} \frac{x^{\ell}}{r^{3}}+0\left(\frac{1}{r^{3}}\right)\right] d t d x^{j} \\ & +\left[\left(1+\frac{2 M}{r}+\frac{3 M^{2}}{2 r^{2}}\right) \delta_{j k}+(\text { gravitational radiation terms })\right] d x^{j} d x^{k} \tag{1} \end{align*}ds2=[12Mr+2M2r2+0(1r3)]dt2[4ϵjkSkxr3+0(1r3)]dtdxj(1)+[(1+2Mr+3M22r2)δjk+( gravitational radiation terms )]dxjdxk
D. Conservation of 4-momentum and angular momentum: Suppose that particles fall into a source or are ejected from it; suppose that electromagnetic waves flow in and out; suppose the source emits gravitational waves. All such processes break the source's isolation and can change its total 4momentum P P P\boldsymbol{P}P, its intrinsic angular momentum S S S\boldsymbol{S}S, and its asymptotic rest frame. Surround the source with a spherical shell S S SSS, which is far enough out to be in the asymptotically flat region. Keep this shell always at rest in the source's momentary asymptotic rest frame. By probing the source's gravitational field near S S S\mathcal{S}S, measure its 4 -momentum P P P\boldsymbol{P}P and intrinsic angular momentum S S S\boldsymbol{S}S as functions of the shell's proper time τ τ tau\tauτ. An analysis given in the next chapter reveals that the 4 -momentum is conserved, in the sense that
Interstellar debris falls into a black hole, and gravitational waves emerge.
(2) d P α d τ = s T α j n j d ( area ) = ( rate at which 4-momentum flows inward through shell ) (2) d P α d τ = s T α j n j d (  area  ) = (  rate at which 4-momentum   flows inward through shell  ) {:(2)(dP^(alpha))/(d tau)=-int_(s)T^(alpha j)n_(j)d(" area ")=((" rate at which 4-momentum ")/(" flows inward through shell ")):}\begin{equation*} \frac{d P^{\alpha}}{d \tau}=-\int_{s} T^{\alpha j} n_{j} d(\text { area })=\binom{\text { rate at which 4-momentum }}{\text { flows inward through shell }} \tag{2} \end{equation*}(2)dPαdτ=sTαjnjd( area )=( rate at which 4-momentum  flows inward through shell )
where n n n\boldsymbol{n}n is the unit outward normal to S S SSS and the integral is performed in the shell's momentary rest frame. In words: the rate at which 4-momentum flows through the shell, as measured in the standard special relativistic manner, equals the rate of change of the source's gravitationally measured 4-momentum. Similarly, the angular momentum is conserved in the sense that
(3a) d S i d τ = S ( ϵ i j k x j T k ) n d ( area ) = ( rate at which angular momentum flows inward through the shell ) , (3b) d S 0 d τ = d U α d τ S α = ( change required to keep S orthogonal to U ; "Fermi-Walker-transport law"; cf. § § 6.5 , 13.6 ) . (3a) d S i d τ = S ϵ i j k x j T k n d (  area  ) =  rate at which angular   momentum flows inward   through the shell  , (3b) d S 0 d τ = d U α d τ S α = (  change required to keep  S  orthogonal to  U ;  "Fermi-Walker-transport law"; cf.  § § 6.5 , 13.6 ) . {:[(3a)(dS_(i))/(d tau)=-int_(S)(epsilon_(ijk)x^(j)T^(kℓ))n_(ℓ)d(" area ")=([" rate at which angular "],[" momentum flows inward "],[" through the shell "])","],[(3b)(dS_(0))/(d tau)=-(dU^(alpha))/(d tau)S_(alpha)=((" change required to keep "S" orthogonal to "U;)/(" "Fermi-Walker-transport law"; cf. "§§6.5,13.6)).]:}\begin{gather*} \frac{d S_{i}}{d \tau}=-\int_{S}\left(\epsilon_{i j k} x^{j} T^{k \ell}\right) n_{\ell} d(\text { area })=\left(\begin{array}{l} \text { rate at which angular } \\ \text { momentum flows inward } \\ \text { through the shell } \end{array}\right), \tag{3a}\\ \frac{d S_{0}}{d \tau}=-\frac{d U^{\alpha}}{d \tau} S_{\alpha}=\binom{\text { change required to keep } \boldsymbol{S} \text { orthogonal to } \boldsymbol{U} ;}{\text { "Fermi-Walker-transport law"; cf. } \S \S 6.5,13.6} . \tag{3b} \end{gather*}(3a)dSidτ=S(ϵijkxjTk)nd( area )=( rate at which angular  momentum flows inward  through the shell ),(3b)dS0dτ=dUαdτSα=( change required to keep S orthogonal to U; "Fermi-Walker-transport law"; cf. §§6.5,13.6).
In these conservation laws T α β T α β T^(alpha beta)T^{\alpha \beta}Tαβ is the total stress-energy tensor at the shell, including contributions from matter, electromagnetic fields, and gravitational waves. The gravitational-wave contribution, called T ( G W ) α β T ( G W ) α β T^((GW)alpha beta)T^{(G W) \alpha \beta}T(GW)αβ, is treated in Chapter 35 .
Note: The conservation laws in the form stated above contain fractional errors of order M / r M / r M//rM / rM/r (contributions from "gravitational potential energy" of infalling material), but such errors go to zero in the limit of a very large shell ( r ) ( r ) (r longrightarrow oo)(r \longrightarrow \infty)(r).
Note: The formulation of these conservation laws given in the next chapter is more precise and more rigorous, but less physically enlightening than the one here.

EXERCISE

Exercise 19.3. GRAVITATIONAL FIELD FAR FROM A STATIONARY, FULLY RELATIVISTIC SOURCE

Derive the line element (19.13) for the special case of a source that is time-independent ( g μ v , t = 0 ) g μ v , t = 0 (g_(mu v,t)=0)\left(g_{\mu v, t}=0\right)(gμv,t=0). This can be a difficult problem, if one does not proceed carefully along the following outlined route. (1) Initially ignore all nonlinearities in the Einstein field equations. The field is weak far from the source. These nonlinearities will be absent from the dominant terms. (2) Calculate the dominant terms using linearized theory in the Lorentz gauge [equations (18.8)]. (3) In particular, write the general solution to the vacuum, time-independent wave equation (18.8b) in the following form involving n j x j / r n j x j / r n^(j)-=x^(j)//r-=n^{j} \equiv x^{j} / r \equivnjxj/r (unit vector in radial direction):
h ¯ 00 = A 0 r + B j n j r 2 + 0 ( 1 r 3 ) h ¯ 0 j = A j r + B j k n k r 2 + 0 ( 1 r 3 ) (19.17) h ¯ j k = A j k r + B j k i n r 2 + 0 ( 1 r 3 ) A j k = A ( j k ) , B j k = B ( j k ) h ¯ 00 = A 0 r + B j n j r 2 + 0 1 r 3 h ¯ 0 j = A j r + B j k n k r 2 + 0 1 r 3 (19.17) h ¯ j k = A j k r + B j k i n r 2 + 0 1 r 3 A j k = A ( j k ) , B j k = B ( j k ) {:[ bar(h)_(00)=(A^(0))/(r)+(B^(j)n^(j))/(r^(2))+0((1)/(r^(3)))],[ bar(h)_(0j)=(A^(j))/(r)+(B^(jk)n^(k))/(r^(2))+0((1)/(r^(3)))],[(19.17) bar(h)_(jk)=(A^(jk))/(r)+(B^(jki)n^(ℓ))/(r^(2))+0((1)/(r^(3)))],[A^(jk)=A^((jk))","quadB^(jkℓ)=B^((jk)ℓ)]:}\begin{align*} & \bar{h}_{00}=\frac{A^{0}}{r}+\frac{B^{j} n^{j}}{r^{2}}+0\left(\frac{1}{r^{3}}\right) \\ & \bar{h}_{0 j}=\frac{A^{j}}{r}+\frac{B^{j k} n^{k}}{r^{2}}+0\left(\frac{1}{r^{3}}\right) \\ & \bar{h}_{j k}=\frac{A^{j k}}{r}+\frac{B^{j k i} n^{\ell}}{r^{2}}+0\left(\frac{1}{r^{3}}\right) \tag{19.17}\\ & A^{j k}=A^{(j k)}, \quad B^{j k \ell}=B^{(j k) \ell} \end{align*}h¯00=A0r+Bjnjr2+0(1r3)h¯0j=Ajr+Bjknkr2+0(1r3)(19.17)h¯jk=Ajkr+Bjkinr2+0(1r3)Ajk=A(jk),Bjk=B(jk)
(Round brackets denote symmetrization.) (4) Then impose the Lorentz gauge conditions h ¯ α β , β = 0 h ¯ α β , β = 0 bar(h)_(alpha)^(beta)_(,beta)=0\bar{h}_{\alpha}{ }^{\beta}{ }_{, \beta}=0h¯αβ,β=0 on this general solution, thereby learning
A j = 0 , A j k = 0 (19.18) B j k ( δ j k 3 n j n k ) = 0 B j k ! ( δ k t 3 n k n t ) = 0 A j = 0 , A j k = 0 (19.18) B j k δ j k 3 n j n k = 0 B j k ! δ k t 3 n k n t = 0 {:[A^(j)=0","quadA^(jk)=0],[(19.18)B^(jk)(delta^(jk)-3n^(j)n^(k))=0],[B^(jk!)(delta^(kt)-3n^(k)n^(t))=0]:}\begin{align*} A^{j}=0, \quad A^{j k} & =0 \\ B^{j k}\left(\delta^{j k}-3 n^{j} n^{k}\right) & =0 \tag{19.18}\\ B^{j k!}\left(\delta^{k t}-3 n^{k} n^{t}\right) & =0 \end{align*}Aj=0,Ajk=0(19.18)Bjk(δjk3njnk)=0Bjk!(δkt3nknt)=0
(5) Write B j k B j k B^(jk)B^{j k}Bjk as the sum of its trace 3 B 3 B 3B3 B3B, its traceless symmetric part S j k S j k S^(jk)S^{j k}Sjk, and its traceless antisymmetric part (these are its "irreducible parts"):
(19.19) B j k = B δ j k + S j k + ϵ i k F , S j j = 0 . (19.19) B j k = B δ j k + S j k + ϵ i k F , S j j = 0 . {:(19.19)B^(jk)=Bdelta^(jk)+S^(jk)+epsilon^(ikℓ)F^(ℓ)","quadS^(jj)=0.:}\begin{equation*} B^{j k}=B \delta^{j k}+S^{j k}+\epsilon^{i k \ell} F^{\ell}, \quad S^{j j}=0 . \tag{19.19} \end{equation*}(19.19)Bjk=Bδjk+Sjk+ϵikF,Sjj=0.
Show that any tensor B j k B j k B^(jk)B^{j k}Bjk can be put into such a form. Then show that the gauge conditions (19.18) imply S j k = 0 S j k = 0 S^(jk)=0S^{j k}=0Sjk=0. (6) Similarly show that any tensor B j k B j k B^(jkℓ)B^{j k \ell}Bjk that is symmetric on its first two indices can be put into the form
(19.20) B j k = δ j k A + C ( j δ k ) + ϵ m ( j E k ) m + S j k (19.20) B j k = δ j k A + C ( j δ k ) + ϵ m ( j E k ) m + S j k {:(19.20)B^(jkℓ)=delta^(jk)A^(ℓ)+C^((j)delta^(k)ℓ)+epsilon^(mℓ(j)E^(k)m)+S^(jkℓ):}\begin{equation*} B^{j k \ell}=\delta^{j k} A^{\ell}+C^{(j} \delta^{k) \ell}+\epsilon^{m \ell(j} E^{k) m}+S^{j k \ell} \tag{19.20} \end{equation*}(19.20)Bjk=δjkA+C(jδk)+ϵm(jEk)m+Sjk
E k m E k m E^(km)E^{k m}Ekm symmetric and traceless, i.e., E k m = E ( k m ) , E k k = 0 E k m = E ( k m ) , E k k = 0 quadE^(km)=E^((km)),quadE^(kk)=0\quad E^{k m}=E^{(k m)}, \quad E^{k k}=0Ekm=E(km),Ekk=0, S j k S j k S^(jkℓ)S^{j k \ell}Sjk symmetric and traceless, i.e., S j k = S ( j k ) S j k = S ( j k ) quadS^(jkℓ)=S^((jkℓ))\quad S^{j k \ell}=S^{(j k \ell)}Sjk=S(jk),
S j j = S j k k = S j k j = 0 S j j = S j k k = S j k j = 0 S^(jjℓ)=S^(jkk)=S^(jkj)=0S^{j j \ell}=S^{j k k}=S^{j k j}=0Sjj=Sjkk=Sjkj=0
Then show that the gauge conditions (19.18) imply C j = 2 A j C j = 2 A j C^(j)=-2A^(j)C^{j}=-2 A^{j}Cj=2Aj and E k m = S j k t = 0 E k m = S j k t = 0 E^(km)=S^(jkt)=0E^{k m}=S^{j k t}=0Ekm=Sjkt=0. (7) Combining all these results, conclude that
h ¯ 00 = A 0 r + B j n j r 2 + 0 ( 1 r 3 ) , (19.21) h ¯ 0 j = ϵ j k i n k F i r 2 + B n j r 2 + 0 ( 1 r 3 ) , h ¯ j k = δ j k A i n i A j n k A k n j r 2 + 0 ( 1 r 3 ) . h ¯ 00 = A 0 r + B j n j r 2 + 0 1 r 3 , (19.21) h ¯ 0 j = ϵ j k i n k F i r 2 + B n j r 2 + 0 1 r 3 , h ¯ j k = δ j k A i n i A j n k A k n j r 2 + 0 1 r 3 . {:[ bar(h)_(00)=(A^(0))/(r)+(B^(j)n^(j))/(r^(2))+0((1)/(r^(3)))","],[(19.21) bar(h)_(0j)=(epsilon^(jki)n^(k)F^(i))/(r^(2))+(Bn^(j))/(r^(2))+0((1)/(r^(3)))","],[ bar(h)_(jk)=(delta^(jk)A^(i)n^(i)-A^(j)n^(k)-A^(k)n^(j))/(r^(2))+0((1)/(r^(3))).]:}\begin{align*} & \bar{h}_{00}=\frac{A^{0}}{r}+\frac{B^{j} n^{j}}{r^{2}}+0\left(\frac{1}{r^{3}}\right), \\ & \bar{h}_{0 j}=\frac{\epsilon^{j k i} n^{k} F^{i}}{r^{2}}+\frac{B n^{j}}{r^{2}}+0\left(\frac{1}{r^{3}}\right), \tag{19.21}\\ & \bar{h}_{j k}=\frac{\delta^{j k} A^{i} n^{i}-A^{j} n^{k}-A^{k} n^{j}}{r^{2}}+0\left(\frac{1}{r^{3}}\right) . \end{align*}h¯00=A0r+Bjnjr2+0(1r3),(19.21)h¯0j=ϵjkinkFir2+Bnjr2+0(1r3),h¯jk=δjkAiniAjnkAknjr2+0(1r3).
Then use gauge transformations, which stay within Lorentz gauge, to eliminate B B BBB and A j A j A^(j)A^{j}Aj from h ¯ 0 j h ¯ 0 j bar(h)_(0j)\bar{h}_{0 j}h¯0j and h ¯ j k h ¯ j k bar(h)_(jk)\bar{h}_{j k}h¯jk; so
h ¯ 00 = A 0 r + ( B j + A j ) n j r 2 + 0 ( 1 r 3 ) (19.22) h ¯ 0 j = ϵ j k n k F r 2 + 0 ( 1 r 3 ) h ¯ j k = 0 ( 1 r 3 ) h ¯ 00 = A 0 r + B j + A j n j r 2 + 0 1 r 3 (19.22) h ¯ 0 j = ϵ j k n k F r 2 + 0 1 r 3 h ¯ j k = 0 1 r 3 {:[ bar(h)_(00)=(A^(0))/(r)+((B^(j)+A^(j))n^(j))/(r^(2))+0((1)/(r^(3)))],[(19.22) bar(h)_(0j)=(epsilon^(jkℓ)n^(k)F^(ℓ))/(r^(2))+0((1)/(r^(3)))],[ bar(h)_(jk)=0((1)/(r^(3)))]:}\begin{align*} & \bar{h}_{00}=\frac{A^{0}}{r}+\frac{\left(B^{j}+A^{j}\right) n^{j}}{r^{2}}+0\left(\frac{1}{r^{3}}\right) \\ & \bar{h}_{0 j}=\frac{\epsilon^{j k \ell} n^{k} F^{\ell}}{r^{2}}+0\left(\frac{1}{r^{3}}\right) \tag{19.22}\\ & \bar{h}_{j k}=0\left(\frac{1}{r^{3}}\right) \end{align*}h¯00=A0r+(Bj+Aj)njr2+0(1r3)(19.22)h¯0j=ϵjknkFr2+0(1r3)h¯jk=0(1r3)
(8) Translate the origin of coordinates so x j new = x j old ( B j + A j ) / A 0 x j new  = x j old  B j + A j / A 0 x^(j)_("new ")=x^(j)_("old ")-(B^(j)+A^(j))//A^(0)x^{j}{ }_{\text {new }}=x^{j}{ }_{\text {old }}-\left(B^{j}+A^{j}\right) / A^{0}xjnew =xjold (Bj+Aj)/A0; in the new coordinate system h ¯ α β h ¯ α β bar(h)_(alpha beta)\bar{h}_{\alpha \beta}h¯αβ has the same form as (19.22), but with B j + A j B j + A j B^(j)+A^(j)B^{j}+A^{j}Bj+Aj removed. From the resultant h ¯ α β h ¯ α β bar(h)_(alpha beta)\bar{h}_{\alpha \beta}h¯αβ, construct the metric and redefine the constants A 0 A 0 A^(0)A^{0}A0 and F F F^(ℓ)F^{\ell}F to agree with equation (19.13). (9) All linear terms in the metric are now accounted for. The dominant nonlinear terms must be proportional to the square, ( M / r ) 2 ( M / r ) 2 (M//r)^(2)(M / r)^{2}(M/r)2, of the dominant linear term. The easiest way to get the proportionality constant is to take the Schwarzschild geometry for a fully relativistic, static, spherical source [equation (31.1)], by a change of coordinates put it in the form
(19.23) d s 2 = ( 1 M / 2 r 1 + M / 2 r ) 2 d t 2 + ( 1 + M 2 r ) 4 ( d x 2 + d y 2 + d z 2 ) (19.23) d s 2 = 1 M / 2 r 1 + M / 2 r 2 d t 2 + 1 + M 2 r 4 d x 2 + d y 2 + d z 2 {:(19.23)ds^(2)=-((1-M//2r)/(1+M//2r))^(2)dt^(2)+(1+(M)/(2r))^(4)(dx^(2)+dy^(2)+dz^(2)):}\begin{equation*} d s^{2}=-\left(\frac{1-M / 2 r}{1+M / 2 r}\right)^{2} d t^{2}+\left(1+\frac{M}{2 r}\right)^{4}\left(d x^{2}+d y^{2}+d z^{2}\right) \tag{19.23} \end{equation*}(19.23)ds2=(1M/2r1+M/2r)2dt2+(1+M2r)4(dx2+dy2+dz2)
(exercise 25.8), and expand it in powers of M / r M / r M//rM / rM/r.

§19.4. MASS AND ANGULAR MOMENTUM OF a Closed universe

"There are no snakes in Ireland."

Statement of St. Patrick after driving the snakes out of Ireland (legend")
There is no such thing as "the energy (or angular momentum, or charge) of a closed universe," according to general relativity, and this for a simple reason. To weigh something one needs a platform on which to stand to do the weighing.
To weigh the sun, one measures the periods and semimajor axes of planetary orbits, and applies Kepler's "1-2-3" law, M = ω 2 a 3 M = ω 2 a 3 M=omega^(2)a^(3)M=\omega^{2} a^{3}M=ω2a3. To measure the angular momentum, S S S\boldsymbol{S}S, of the sun (a task for space technology in the 1970's or 1980's!), one measures the precession of a gyroscope in a near orbit about the sun, or one examines some other aspect of the "dragging of inertial frames." To determine the electric charge
For a closed universe the total mass-energy M M MMM and angular momentum S S SSS are undefined and undefinable

of a body, one surrounds it by a large sphere, evaluates the electric field normal to the surface at each point on this sphere, integrates over the sphere, and applies the theorem of Gauss. But within any closed model universe with the topology of a 3-sphere, a Gaussian 2-sphere that is expanded widely enough from one point finds itself collapsing to nothingness at the antipodal point. Also collapsed to nothingness is the attempt to acquire useful information about the "charge of the universe": the charge is trivially zero. By the same token, every "surface integral" (see details in Chapter 20) to determine mass-energy or angular momentum collapses to nothingness. To make the same point in another way: around a closed universe there is no place to put a test object or gyroscope into Keplerian orbit to determine either any so-called "total mass" or "rest frame" or "4-momentum" or "angular momentum" of the system. These terms are undefined and undefinable. Words, yes; meaning, no.
Not having a defined 4 -momentum for a closed universe may seem at first sight disturbing; but it would be far more disturbing to be given four numbers and to be told authoritatively that they represent the components of some purported "total energy-momentum 4 -vector of the universe." Components with respect to what local Lorentz frame? At what point? And what about the change in this vector on parallel transport around a closed path leading back to that strangely preferred point? It is a happy salvation from these embarrassments that the issue does not and cannot arise!
Imagine a fantastically precise measurement of the energy of a γ γ gamma\gammaγ-ray. The experimenter wishes to know how much this γ γ gamma\gammaγ-ray contributes to the total mass-energy of the universe. Having measured its energy in the laboratory, he then corrects it for the negative gravitational energy by which it is bound to the Earth. The result,
E corrected = h ν ( 1 M / R ) , E corrected  = h ν 1 M / R , E_("corrected ")=h nu(1-M_(o+)//R_(o+)),E_{\text {corrected }}=h \nu\left(1-M_{\oplus} / R_{\oplus}\right),Ecorrected =hν(1M/R),
is the energy the photon will have after it climbs out of the Earth's gravitational field. But this is only the first in a long chain of corrections for energy losses (redshifts) as the photon climbs out of the gravitational fields of the solar system, the galaxy, the local cluster of galaxies, the supercluster, and then what? These corrections show no sign of converging, unless to E corrected = 0 E corrected  = 0 E_("corrected ")=0E_{\text {corrected }}=0Ecorrected =0.
Quite in contrast to the charge-energy-angular-momentum facelessness of a closed universe are the attractive possibilities of defining and measuring all three quantities in any space that is asymptotically flat. One does not have to revolutionize presentday views of cosmology to talk of asymptotically flat space. It is enough to note how small is the departure from flatness, as measured by the departure of ( g 00 ) 1 / 2 g 00 1 / 2 (-g_(00))^(1//2)\left(-g_{00}\right)^{1 / 2}(g00)1/2 from unity, in cases of astronomical or astrophysical interest (Box 19.2). Surrounding a region where any dynamics, however complicated, is going on, whenever the geometry is asymptotically flat to some specified degree of precision, then to that degree of precision it makes sense to speak of the total energy-momentum 4 -vector of the dynamic region, P P P\boldsymbol{P}P, and its total intrinsic angular momentum, S S S\boldsymbol{S}S. Parallel transport of either around any closed curve in the flat region brings it back to its
Box 19.2 METRIC CORRECTION TERM NEAR SELECTED HEAVENLY BODIES
m m mmm m m mmm r r rrr m r = 1 ( g 00 ) 1 / 2 m r = 1 g 00 1 / 2 (m)/(r)=1-(-g_(00))^(1//2)\frac{m}{r}=1-\left(-g_{00}\right)^{1 / 2}mr=1(g00)1/2
At shoulder of Venus de Milo 2 × 10 5 g = 2 × 10 5 g = 2xx10^(5)g=2 \times 10^{5} \mathrm{~g}=2×105 g= 1.5 × 10 23 cm 1.5 × 10 23 cm 1.5 xx10^(-23)cm1.5 \times 10^{-23} \mathrm{~cm}1.5×1023 cm 30 cm 5 × 10 25 5 × 10 25 5xx10^(-25)5 \times 10^{-25}5×1025
At surface of Earth 6 × 10 27 g = 6 × 10 27 g = 6xx10^(27)g=6 \times 10^{27} \mathrm{~g}=6×1027 g= 4 × 10 1 cm 4 × 10 1 cm 4xx10^(-1)cm4 \times 10^{-1} \mathrm{~cm}4×101 cm 6.4 × 10 8 cm 6.4 × 10 8 cm 6.4 xx10^(8)cm6.4 \times 10^{8} \mathrm{~cm}6.4×108 cm 6 × 10 10 6 × 10 10 6xx10^(-10)6 \times 10^{-10}6×1010
At Earth's distance from sun 2 × 10 33 g = 2 × 10 33 g = 2xx10^(33)g=2 \times 10^{33} \mathrm{~g}=2×1033 g= 1.5 × 10 5 cm 1.5 × 10 5 cm 1.5 xx10^(5)cm1.5 \times 10^{5} \mathrm{~cm}1.5×105 cm 1.5 × 10 13 cm 1.5 × 10 13 cm 1.5 xx10^(13)cm1.5 \times 10^{13} \mathrm{~cm}1.5×1013 cm 1 × 10 8 1 × 10 8 1xx10^(-8)1 \times 10^{-8}1×108
At sun's distance from center of galaxy 2 × 10 44 g = 2 × 10 44 g = 2xx10^(44)g=2 \times 10^{44} \mathrm{~g}=2×1044 g= 1.5 × 10 16 cm 1.5 × 10 16 cm 1.5 xx10^(16)cm1.5 \times 10^{16} \mathrm{~cm}1.5×1016 cm 2.5 × 10 22 cm 2.5 × 10 22 cm 2.5 xx10^(22)cm2.5 \times 10^{22} \mathrm{~cm}2.5×1022 cm 6 × 10 7 6 × 10 7 6xx10^(-7)6 \times 10^{-7}6×107
At distance of galaxy from center of Virgo cluster of galaxies 6 × 10 47 g = 6 × 10 47 g = 6xx10^(47)g=6 \times 10^{47} \mathrm{~g}=6×1047 g= 4 × 10 19 cm 4 × 10 19 cm 4xx10^(19)cm4 \times 10^{19} \mathrm{~cm}4×1019 cm 3 × 10 25 cm 3 × 10 25 cm 3xx10^(25)cm3 \times 10^{25} \mathrm{~cm}3×1025 cm 1 × 10 6 1 × 10 6 1xx10^(-6)1 \times 10^{-6}1×106
m m r (m)/(r)=1-(-g_(00))^(1//2) At shoulder of Venus de Milo 2xx10^(5)g= 1.5 xx10^(-23)cm 30 cm 5xx10^(-25) At surface of Earth 6xx10^(27)g= 4xx10^(-1)cm 6.4 xx10^(8)cm 6xx10^(-10) At Earth's distance from sun 2xx10^(33)g= 1.5 xx10^(5)cm 1.5 xx10^(13)cm 1xx10^(-8) At sun's distance from center of galaxy 2xx10^(44)g= 1.5 xx10^(16)cm 2.5 xx10^(22)cm 6xx10^(-7) At distance of galaxy from center of Virgo cluster of galaxies 6xx10^(47)g= 4xx10^(19)cm 3xx10^(25)cm 1xx10^(-6)| | $m$ | $m$ | $r$ | $\frac{m}{r}=1-\left(-g_{00}\right)^{1 / 2}$ | | :---: | :---: | :---: | :---: | :---: | | At shoulder of Venus de Milo | $2 \times 10^{5} \mathrm{~g}=$ | $1.5 \times 10^{-23} \mathrm{~cm}$ | 30 cm | $5 \times 10^{-25}$ | | At surface of Earth | $6 \times 10^{27} \mathrm{~g}=$ | $4 \times 10^{-1} \mathrm{~cm}$ | $6.4 \times 10^{8} \mathrm{~cm}$ | $6 \times 10^{-10}$ | | At Earth's distance from sun | $2 \times 10^{33} \mathrm{~g}=$ | $1.5 \times 10^{5} \mathrm{~cm}$ | $1.5 \times 10^{13} \mathrm{~cm}$ | $1 \times 10^{-8}$ | | At sun's distance from center of galaxy | $2 \times 10^{44} \mathrm{~g}=$ | $1.5 \times 10^{16} \mathrm{~cm}$ | $2.5 \times 10^{22} \mathrm{~cm}$ | $6 \times 10^{-7}$ | | At distance of galaxy from center of Virgo cluster of galaxies | $6 \times 10^{47} \mathrm{~g}=$ | $4 \times 10^{19} \mathrm{~cm}$ | $3 \times 10^{25} \mathrm{~cm}$ | $1 \times 10^{-6}$ |
starting point unchanged. Moreover, it makes no difference how enormous are the departures from flatness in the dynamic region (black holes, collapsing stars, intense gravitational waves, etc.); far away the curvature will be weak, and the 4 -momentum and angular momentum will reveal themselves by their imprints on the spacetime geometry.

CONSERVATION LAWS FOR 4-MOMENTUM AND ANGULAR MOMENTUM

We denote as energy of a material system in a certain state the contribution of all effects (measured in mechanical units of work) produced outside the system when it passes in an arbitrary manner from its state to a reference state which has been defined ad hoc.
WILLIAM THOMPSON (later Lord Kelvin), as quoted by Max von Laue in Schilpp (1949), p. 514.
All forms of energy possess inertia.
ALBERT EINSTEIN, conclusion
from his paper of September 26, 1905,
as summarized by von Laue in Schilpp (1949), p. 523.
Chapter 5 (stress-energy tensor) is needed as preparation for this chapter, which in turn is needed as preparation for the Track-2 portion of Chapter 36 (generation of gravitational waves) and will be useful in understanding Chapter 35 (propagation of gravitational waves).

§20.1. OVERVIEW

Chapter 19 expounded the key features of total 4-momentum P P P\boldsymbol{P}P and total angular momentum S S S\boldsymbol{S}S for an arbitrary, gravitating system. But one crucial feature was left unproved: the conservation laws for P P P\boldsymbol{P}P and S S S\boldsymbol{S}S (Box 19.1). To prove those conservation laws is the chief purpose of this chapter. But other interesting, if less important, aspects of P P P\boldsymbol{P}P and S S S\boldsymbol{S}S will be encountered along the route to the proof-Gaussian flux integrals for 4-momentum and angular momentum; a stress-energy "pseudotensor" for the gravitational field, which is a tool in constructing volume integrals for P P P\boldsymbol{P}P and S S S\boldsymbol{S}S; and the nonlocalizability of the energy of the gravitational field.

§20.2. GAUSSIAN FLUX INTEGRALS FOR 4-MOMENTUM AND ANGULAR MOMENTUM

In electromagnetic theory one can determine the conserved total charge of a source by adding up the number of electric field lines emanating from it-i.e., by performing a Gaussian flux integral over a closed 2-surface surrounding it:
(20.1) Q = 1 4 π E j d 2 S j = 1 4 π F 0 j d 2 S j (20.1) Q = 1 4 π E j d 2 S j = 1 4 π F 0 j d 2 S j {:(20.1)Q=(1)/(4pi)ointE^(j)d^(2)S_(j)=(1)/(4pi)ointF^(0j)d^(2)S_(j):}\begin{equation*} Q=\frac{1}{4 \pi} \oint E^{j} d^{2} S_{j}=\frac{1}{4 \pi} \oint F^{0 j} d^{2} S_{j} \tag{20.1} \end{equation*}(20.1)Q=14πEjd2Sj=14πF0jd2Sj
Similarly, in Newtonian theory one can determine the mass of a source by evaluating the Gaussian flux integral
(20.2) M = 1 4 π Φ , j d 2 S j . (20.2) M = 1 4 π Φ , j d 2 S j . {:(20.2)M=(1)/(4pi)ointPhi_(,j)d^(2)S_(j).:}\begin{equation*} M=\frac{1}{4 \pi} \oint \Phi_{, j} d^{2} S_{j} . \tag{20.2} \end{equation*}(20.2)M=14πΦ,jd2Sj.
These flux integrals work because the charge and mass of a source place indelible imprints on the electromagnetic and gravitational fields that envelop it.
The external gravitational field (spacetime geometry) in general relativity possesses similar imprints, imprints not only of the source's total mass-energy M M MMM, but also of its total 4-momentum P P P\boldsymbol{P}P and its intrinsic angular momentum S S S\boldsymbol{S}S (see Box 19.1). Hence, it is reasonable to search for Gaussian flux integrals that represent the 4-momentum and angular momentum of the source.
To simplify the search, carry it out initially in linearized theory, and use Maxwell electrodynamics as a guide. In electrodynamics the Gaussian flux integral for charge follows from Maxwell's equations F μ ν , ν = 4 π J μ F μ ν , ν = 4 π J μ F^(mu nu)_(,nu)=4piJ^(mu)F^{\mu \nu}{ }_{, \nu}=4 \pi J^{\mu}Fμν,ν=4πJμ, plus the crucial fact that F μ ν F μ ν F^(mu nu)F^{\mu \nu}Fμν is antisymmetric, so that F , μ 0 μ = F , j 0 j F , μ 0 μ = F , j 0 j F_(,mu)^(0mu)=F_(,j)^(0j)F_{, \mu}^{0 \mu}=F_{, j}^{0 j}F,μ0μ=F,j0j :
Q = J 0 d 3 x = 1 4 π F 0 , ν d 3 x = 1 4 π F 0 j , j d 3 x = 1 4 π F 0 j d 2 S j . [ Gauss's theorem] Q = J 0 d 3 x = 1 4 π F 0 , ν d 3 x = 1 4 π F 0 j , j d 3 x = 1 4 π F 0 j d 2 S j . [  Gauss's theorem]  {:[Q=intJ^(0)d^(3)x=(1)/(4pi)intF^(0)_(,nu)d^(3)x=(1)/(4pi)intF^(0j)_(,j)d^(3)x=(1)/(4pi)ointF^(0j)d^(2)S_(j).],[[" Gauss's theorem] "]:}\begin{gathered} Q=\int J^{0} d^{3} x=\frac{1}{4 \pi} \int F^{0}{ }_{, \nu} d^{3} x=\frac{1}{4 \pi} \int F^{0 j}{ }_{, j} d^{3} x=\frac{1}{4 \pi} \oint F^{0 j} d^{2} S_{j} . \\ {[\text { Gauss's theorem] }} \end{gathered}Q=J0d3x=14πF0,νd3x=14πF0j,jd3x=14πF0jd2Sj.[ Gauss's theorem] 
To find analogous flux integrals in linearized theory, rewrite the linearized field equations (18.7) in an analogous form involving an entity with analogous crucial symmetries. The entity needed turns out to be
(20.3) H μ α ν β ( h ¯ μ ν η α β + η μ ν h ¯ α β h ¯ α ν η μ β h ¯ μ β η α ν ) (20.3) H μ α ν β h ¯ μ ν η α β + η μ ν h ¯ α β h ¯ α ν η μ β h ¯ μ β η α ν {:(20.3)H^(mu alpha nu beta)-=-( bar(h)^(mu nu)eta^(alpha beta)+eta^(mu nu) bar(h)^(alpha beta)- bar(h)^(alpha nu)eta^(mu beta)- bar(h)^(mu beta)eta^(alpha nu)):}\begin{equation*} H^{\mu \alpha \nu \beta} \equiv-\left(\bar{h}^{\mu \nu} \eta^{\alpha \beta}+\eta^{\mu \nu} \bar{h}^{\alpha \beta}-\bar{h}^{\alpha \nu} \eta^{\mu \beta}-\bar{h}^{\mu \beta} \eta^{\alpha \nu}\right) \tag{20.3} \end{equation*}(20.3)Hμανβ(h¯μνηαβ+ημνh¯αβh¯ανημβh¯μβηαν)
As one readily verifies from this expression, it has the same symmetries as the Riemann tensor
H μ α ν β = H ν β μ α = H [ μ α ] [ ν β ] , (20.4) H μ [ α ν β ] = 0 . H μ α ν β = H ν β μ α = H [ μ α ] [ ν β ] , (20.4) H μ [ α ν β ] = 0 . {:[H^(mu alpha nu beta)=H^(nu beta mu alpha)=H^([mu alpha][nu beta])","],[(20.4)H^(mu[alpha nu beta])=0.]:}\begin{gather*} H^{\mu \alpha \nu \beta}=H^{\nu \beta \mu \alpha}=H^{[\mu \alpha][\nu \beta]}, \\ H^{\mu[\alpha \nu \beta]}=0 . \tag{20.4} \end{gather*}Hμανβ=Hνβμα=H[μα][νβ],(20.4)Hμ[ανβ]=0.
This entity, like h ¯ μ ν h ¯ μ ν bar(h)^(mu nu)\bar{h}^{\mu \nu}h¯μν, transforms as a tensor under the Lorentz transformations of linearized theory; but it is not gauge-invariant, so it is not a tensor in the general relativistic sense.
Gaussian flux integrals for charge and Newtonian mass
Linearized field equations in terms of H μ α ν β H μ α ν β H^(mu alpha nu beta)H^{\mu \alpha \nu \beta}Hμανβ
Gaussian flux integrals in linearized theory: (1) for 4-momentum
(2) for angular momentum
Generalization of Gaussian flux integrals to full general relativity
In terms of H μ α ν β H μ α ν β H^(mu alpha nu beta)H^{\mu \alpha \nu \beta}Hμανβ, the linearized field equations (18.7) take on the much simplified form
(20.5) 2 G μ ν = H , α β μ α ν β = 16 π T μ ν (20.5) 2 G μ ν = H , α β μ α ν β = 16 π T μ ν {:(20.5)2G^(mu nu)=H_(,alpha beta)^(mu alpha nu beta)=16 piT^(mu nu):}\begin{equation*} 2 G^{\mu \nu}=H_{, \alpha \beta}^{\mu \alpha \nu \beta}=16 \pi T^{\mu \nu} \tag{20.5} \end{equation*}(20.5)2Gμν=H,αβμανβ=16πTμν
and from these, by antisymmetry of H μ α ν β H μ α ν β H^(mu alpha nu beta)H^{\mu \alpha \nu \beta}Hμανβ in ν ν nu\nuν and β β beta\betaβ, follow the source conservation laws of linearized theory,
T , ν μ ν = 1 16 π H , α β ν μ α ν β = 0 T , ν μ ν = 1 16 π H , α β ν μ α ν β = 0 T_(,nu)^(mu nu)=(1)/(16 pi)H_(,alpha beta nu)^(mu alpha nu beta)=0T_{, \nu}^{\mu \nu}=\frac{1}{16 \pi} H_{, \alpha \beta \nu}^{\mu \alpha \nu \beta}=0T,νμν=116πH,αβνμανβ=0
which were discussed back in §18.3. The same antisymmetry as yields these equations of motion also produces a Gaussian flux integral for the source's total 4-momentum:
P μ = T μ 0 d 3 x = 1 16 π H , α β μ α 0 β d 3 x = 1 16 π H , α j μ α 0 j d 3 x (20.6) = 1 16 π S H μ α 0 j , α d 2 S j . [Gauss's theorem] P μ = T μ 0 d 3 x = 1 16 π H , α β μ α 0 β d 3 x = 1 16 π H , α j μ α 0 j d 3 x (20.6) = 1 16 π S H μ α 0 j , α d 2 S j .  [Gauss's theorem]  {:[P^(mu)=intT^(mu0)d^(3)x=(1)/(16 pi)intH_(,alpha beta)^(mu alpha0beta)d^(3)x=(1)/(16 pi)intH_(,alpha j)^(mu alpha0j)d^(3)x],[(20.6)=(1)/(16 pi)oint_(S)H^(mu alpha0j)_(,alpha)d^(2)S_(j).],[" [Gauss's theorem] "]:}\begin{align*} P^{\mu} & =\int T^{\mu 0} d^{3} x=\frac{1}{16 \pi} \int H_{, \alpha \beta}^{\mu \alpha 0 \beta} d^{3} x=\frac{1}{16 \pi} \int H_{, \alpha j}^{\mu \alpha 0 j} d^{3} x \\ & =\frac{1}{16 \pi} \oint_{S} H^{\mu \alpha 0 j}{ }_{, \alpha} d^{2} S_{j} . \tag{20.6}\\ & \text { [Gauss's theorem] } \end{align*}Pμ=Tμ0d3x=116πH,αβμα0βd3x=116πH,αjμα0jd3x(20.6)=116πSHμα0j,αd2Sj. [Gauss's theorem] 
Here the closed 2 -surface of integration S S SSS must completely surround the source and must lie in a 3 -surface of constant time x 0 x 0 x^(0)x^{0}x0. The integral (20.6) for the source's energy P 0 P 0 P^(0)P^{0}P0, which is used more frequently than the integrals for P j P j P^(j)P^{j}Pj, reduces to an especially simple form in terms of g α β = η α β + h α β g α β = η α β + h α β g_(alpha beta)=eta_(alpha beta)+h_(alpha beta)g_{\alpha \beta}=\eta_{\alpha \beta}+h_{\alpha \beta}gαβ=ηαβ+hαβ :
(20.7) P 0 = 1 16 π S ( g j k , k g k k , j ) d 2 S j (20.7) P 0 = 1 16 π S g j k , k g k k , j d 2 S j {:(20.7)P^(0)=(1)/(16 pi)int_(S)(g_(jk,k)-g_(kk,j))d^(2)S_(j):}\begin{equation*} P^{0}=\frac{1}{16 \pi} \int_{S}\left(g_{j k, k}-g_{k k, j}\right) d^{2} S_{j} \tag{20.7} \end{equation*}(20.7)P0=116πS(gjk,kgkk,j)d2Sj
(see exercise 20.1).
A calculation similar to (20.6), but more lengthy (exercise 20.2), yields a flux integral for total angular momentum about the origin of coordinates:
J μ ν = ( x μ T ν 0 x ν T μ 0 ) d 3 x (20.8) = 1 16 π S ( x μ H ν α 0 j , α x ν H μ α 0 j , α + H μ j 0 ν H ν j 0 μ ) d 2 S j . J μ ν = x μ T ν 0 x ν T μ 0 d 3 x (20.8) = 1 16 π S x μ H ν α 0 j , α x ν H μ α 0 j , α + H μ j 0 ν H ν j 0 μ d 2 S j . {:[J^(mu nu)=int(x^(mu)T^(nu0)-x^(nu)T^(mu0))d^(3)x],[(20.8)=(1)/(16 pi)oint_(S)(x^(mu)H^(nu alpha0j)_(,alpha)-x^(nu)H^(mu alpha0j)_(,alpha)+H^(mu j0nu)-H^(nu j0mu))d^(2)S_(j).]:}\begin{align*} J^{\mu \nu} & =\int\left(x^{\mu} T^{\nu 0}-x^{\nu} T^{\mu 0}\right) d^{3} x \\ & =\frac{1}{16 \pi} \oint_{S}\left(x^{\mu} H^{\nu \alpha 0 j}{ }_{, \alpha}-x^{\nu} H^{\mu \alpha 0 j}{ }_{, \alpha}+H^{\mu j 0 \nu}-H^{\nu j 0 \mu}\right) d^{2} S_{j} . \tag{20.8} \end{align*}Jμν=(xμTν0xνTμ0)d3x(20.8)=116πS(xμHνα0j,αxνHμα0j,α+Hμj0νHνj0μ)d2Sj.
To evaluate the flux integrals in (20.6) to (20.8) (by contrast with the volume integrals), one need utilize only the gravitational field far outside the source. Since that gravitational field has the same form in full general relativity for strong sources as in linearized theory for weak sources, the flux integrals can be used to calculate P μ P μ P^(mu)P^{\mu}Pμ and J μ ν J μ ν J^(mu nu)J^{\mu \nu}Jμν for any isolated source whatsoever, weak or strong:
P μ = 1 16 π S H , α μ α 0 j d 2 S j (20.9) P 0 = 1 16 π S ( g j k , k g k k , j ) d 2 S j J μ ν = 1 16 π S ( x μ H ν α 0 j , α x ν H μ α 0 j , α + H μ j 0 ν H v j 0 μ ) d 2 S j P μ = 1 16 π S H , α μ α 0 j d 2 S j (20.9) P 0 = 1 16 π S g j k , k g k k , j d 2 S j J μ ν = 1 16 π S x μ H ν α 0 j , α x ν H μ α 0 j , α + H μ j 0 ν H v j 0 μ d 2 S j {:[P^(mu)=(1)/(16 pi)oint_(S)H_(,alpha)^(mu alpha0j)d^(2)S_(j)],[(20.9)P^(0)=(1)/(16 pi)oint_(S)(g_(jk,k)-g_(kk,j))d^(2)S_(j)],[J^(mu nu)=(1)/(16 pi)oint_(S)(x^(mu)H^(nu alpha0j)_(,alpha)-x^(nu)H^(mu alpha0j)_(,alpha):}],[{: quad+H^(mu j0nu)-H^(vj0mu))d^(2)S_(j)]:}\begin{align*} P^{\mu}= & \frac{1}{16 \pi} \oint_{S} H_{, \alpha}^{\mu \alpha 0 j} d^{2} S_{j} \\ P^{0}= & \frac{1}{16 \pi} \oint_{S}\left(g_{j k, k}-g_{k k, j}\right) d^{2} S_{j} \tag{20.9}\\ J^{\mu \nu}= & \frac{1}{16 \pi} \oint_{S}\left(x^{\mu} H^{\nu \alpha 0 j}{ }_{, \alpha}-x^{\nu} H^{\mu \alpha 0 j}{ }_{, \alpha}\right. \\ & \left.\quad+H^{\mu j 0 \nu}-H^{v j 0 \mu}\right) d^{2} S_{j} \end{align*}Pμ=116πSH,αμα0jd2Sj(20.9)P0=116πS(gjk,kgkk,j)d2SjJμν=116πS(xμHνα0j,αxνHμα0j,α+Hμj0νHvj0μ)d2Sj
theory, for any isolated source, when the closed surface of integration S S SSS is in the asymptotically flat region surrounding the source, and when asymptotically Minkowskian coordinates are used.
Knowing P μ P μ P^(mu)P^{\mu}Pμ and J μ ν J μ ν J^(mu nu)J^{\mu \nu}Jμν, one can calculate the source's total mass-energy M M MMM and intrinsic angular momentum S μ S μ S^(mu)S^{\mu}Sμ by the standard procedure of Box 5.6 :
(20.10) M = ( P μ P μ ) 1 / 2 , (20.11) Y μ = J μ ν P ν / M 2 = ( vector by which the source's asymptotic, "M/r", spherical field is displaced from being centered on the origin of coordinates ) (20.12) S ρ = 1 2 ϵ μ ν σ ρ ( J μ ν Y μ P ν + Y ν P μ ) P σ / M . (20.10) M = P μ P μ 1 / 2 , (20.11) Y μ = J μ ν P ν / M 2 =  vector by which the source's asymptotic,   "M/r", spherical field is displaced from   being centered on the origin of coordinates  (20.12) S ρ = 1 2 ϵ μ ν σ ρ J μ ν Y μ P ν + Y ν P μ P σ / M . {:[(20.10)M=(-P^(mu)P_(mu))^(1//2)","],[(20.11)Y^(mu)=-J^(mu nu)P_(nu)//M^(2)=([" vector by which the source's asymptotic, "],[" "M/r", spherical field is displaced from "],[" being centered on the origin of coordinates "])],[(20.12)S_(rho)=(1)/(2)epsilon_(mu nu sigma rho)(J^(mu nu)-Y^(mu)P^(nu)+Y^(nu)P^(mu))P^(sigma)//M.]:}\begin{align*} M & =\left(-P^{\mu} P_{\mu}\right)^{1 / 2}, \tag{20.10}\\ Y^{\mu} & =-J^{\mu \nu} P_{\nu} / M^{2}=\left(\begin{array}{l} \text { vector by which the source's asymptotic, } \\ \text { "M/r", spherical field is displaced from } \\ \text { being centered on the origin of coordinates } \end{array}\right) \tag{20.11}\\ S_{\rho} & =\frac{1}{2} \epsilon_{\mu \nu \sigma \rho}\left(J^{\mu \nu}-Y^{\mu} P^{\nu}+Y^{\nu} P^{\mu}\right) P^{\sigma} / M . \tag{20.12} \end{align*}(20.10)M=(PμPμ)1/2,(20.11)Yμ=JμνPν/M2=( vector by which the source's asymptotic,  "M/r", spherical field is displaced from  being centered on the origin of coordinates )(20.12)Sρ=12ϵμνσρ(JμνYμPν+YνPμ)Pσ/M.
Note especially that the integrands of the flux integrals (20.9) are not gauge-invariant. In any local inertial frame at an event P 0 [ g μ ν ( P 0 ) = η μ ν , g μ ν , α ( P 0 ) = 0 ] P 0 g μ ν P 0 = η μ ν , g μ ν , α P 0 = 0 P_(0)[g_(mu nu)(P_(0))=eta_(mu nu),g_(mu nu,alpha)(P_(0))=0]\mathscr{\mathscr { P }}_{0}\left[g_{\mu \nu}\left(\mathscr{P}_{0}\right)=\eta_{\mu \nu}, g_{\mu \nu, \alpha}\left(\mathscr{P}_{0}\right)=0\right]P0[gμν(P0)=ημν,gμν,α(P0)=0] they vanish, since
g μ ν , α = h μ v , α = 0 H μ ν α β , α = 0 ; g μ ν = η μ ν H μ ν α β = 0 . g μ ν , α = h μ v , α = 0 H μ ν α β , α = 0 ; g μ ν = η μ ν H μ ν α β = 0 . g_(mu nu,alpha)=h_(mu v,alpha)=0=>H^(mu nu alpha beta)_(,alpha)=0;quadg_(mu nu)=eta_(mu nu)=>H^(mu nu alpha beta)=0.g_{\mu \nu, \alpha}=h_{\mu v, \alpha}=0 \Rightarrow H^{\mu \nu \alpha \beta}{ }_{, \alpha}=0 ; \quad g_{\mu \nu}=\eta_{\mu \nu} \Rightarrow H^{\mu \nu \alpha \beta}=0 .gμν,α=hμv,α=0Hμναβ,α=0;gμν=ημνHμναβ=0.
This is reasonable behavior; their Newtonian analog, the integrand Φ j = Φ j = Phi_(j)=\Phi_{j}=Φj= (gravitational acceleration) of the Newtonian flux integral (20.2), similarly vanishes in local inertial frames.
Although the integrands of the flux integrals are not gauge-invariant, the total integrals P μ P μ P^(mu)P^{\mu}Pμ (4-momentum) and J μ ν J μ ν J^(mu nu)J^{\mu \nu}Jμν (angular momentum) most assuredly are! They have meaning and significance independent of any coordinate system and gauge. They are tensors in the asymptotically flat region surrounding the source.
The spacetime must be asymptotically flat if there is to be any possibility of defining energy and angular momentum. Only then can linearized theory be applied; and only on the principle that linearized theory applies far away can one justify using the flux integrals (20.9) in the full nonlinear theory. Nobody can compel a physicist to move in close to define energy and angular momentum. He has no need to move in close; and he may have compelling motives not to: the internal structure of the sources may be inaccessible, incomprehensible, uninteresting, dangerous, expensively distant, or frightening. This requirement for far-away flatness is a remarkable feature of the flux integrals (20.9); it is also a decisive feature. Even the coordinates must be asymptotically Minkowskian; otherwise most formulas in this chapter fail or require modification. In particular, when evaluating the 4-momentum and angular momentum of a localized system, one must apply the flux integrals (20.9) only in asymptotically Minkowskian coordinates. If such coordinates do not exist (spacetime not flat at infinity), one must completely abandon the flux integrals, and the quantities that rely on them for definition: the total mass, momentum, and angular momentum of the gravitating source. In this connection, recall the discussion of $19.4. It described, in physical terms, why "total mass-energy" is a limited concept, useful only when one adopts a limited viewpoint that ignores cosmology. (Compare "light ray" or "particle," concepts of enormous value, but concepts that break down when wave optics or wave mechanics enter significantly.)
Summary: Attempts to use formulas (20.9) in ways that lose sight of the Minkowski boundary conditions (and especially simply adopting them unmodified in curvilinear coordinates) easily and unavoidably produce nonsense.
Total mass-energy, center of mass, and intrinsic angular momentum
Gaussian flux integrals valid only in asymptotically flat region of spacetime and in asymptotically Minkowskian coordinates

EXERCISES

The full Einstein field equations in terms of H μ α ν β H μ α ν β H^(mu alpha nu beta)H^{\mu \alpha \nu \beta}Hμανβ
Volume integrals for 4-momentum and angular momentum in full general relativity

Exercise 20.1. FLUX INTEGRAL FOR TOTAL MASS-ENERGY IN LINEARIZED THEORY

Show that the flux integral (20.6) for P 0 P 0 P^(0)P^{0}P0 reduces to (20.7). Then show that, when applied to a nearly Newtonian source [line element (18.15c)], it reduces further to the familiar Newtonian flux integral (20.2).

Exercise 20.2. FLUX INTEGRAL FOR ANGULAR MOMENTUM IN LINEARIZED THEORY

Derive the Gaussian flux integral (20.8) for J μ ν J μ ν J^(mu nu)J^{\mu \nu}Jμν. [Hint: use the field equations (20.5) to show
(20.13) 16 π x μ T ν 0 = ( x μ H , α ν α 0 k ) , k H , j ν j 0 μ H , 0 ν 00 μ ; (20.13) 16 π x μ T ν 0 = x μ H , α ν α 0 k , k H , j ν j 0 μ H , 0 ν 00 μ ; {:(20.13)16 pix^(mu)T^(nu0)=(x^(mu)H_(,alpha)^(nu alpha0k))_(,k)-H_(,j)^(nu j0mu)-H_(,0)^(nu00 mu);:}\begin{equation*} 16 \pi x^{\mu} T^{\nu 0}=\left(x^{\mu} H_{, \alpha}^{\nu \alpha 0 k}\right)_{, k}-H_{, j}^{\nu j 0 \mu}-H_{, 0}^{\nu 00 \mu} ; \tag{20.13} \end{equation*}(20.13)16πxμTν0=(xμH,ανα0k),kH,jνj0μH,0ν00μ;
and then use Gauss's theorem to evaluate the volume integral of equation (20.8)].

Exercise 20.3. FLUX INTEGRALS FOR AN ARBITRARY STATIONARY SOURCE

(a) Use the flux integrals (20.9) to calculate P μ P μ P^(mu)P^{\mu}Pμ and J μ v J μ v J^(mu v)J^{\mu v}Jμv for an arbitrary stationary source. For the asymptotically flat metric around the source, use (19.13), with the gravitational radiation terms set to zero.
(b) Verify that the "auxiliary equations" (20.10) to (20.12) give the correct answer for this source's total mass-energy M M MMM and intrinsic angular momentum S μ S μ S^(mu)S^{\mu}Sμ.

§20.3. VOLUME INTEGRALS FOR 4-MOMENTUM AND ANGULAR MOMENTUM

It is easy, in linearized theory, to convert the surface integrals for P μ P μ P^(mu)P^{\mu}Pμ and J μ ν J μ ν J^(mu nu)J^{\mu \nu}Jμν into volume integrals over the source; one can simply trace backward the steps that led to the surface integrals in the first place [equation (20.6); exercise 20.2]. How, in full general relativity, can one similarly convert from the surface integrals to volume integrals? The answer is rather easy, if one thinks in the right direction. One need only put the full Einstein field equations into the form
(20.14) H , α β μ α ν β = 16 π T eff μ ν (20.14) H , α β μ α ν β = 16 π T eff μ ν {:(20.14)H_(,alpha beta)^(mu alpha nu beta)=16 piT_(eff)^(mu nu):}\begin{equation*} H_{, \alpha \beta}^{\mu \alpha \nu \beta}=16 \pi T_{\mathrm{eff}}^{\mu \nu} \tag{20.14} \end{equation*}(20.14)H,αβμανβ=16πTeffμν
analogous to equations (20.5) of linearized theory. Here H μ α ν β H μ α ν β H^(mu alpha nu beta)H^{\mu \alpha \nu \beta}Hμανβ is to be defined in terms of h μ ν g μ ν η μ ν h μ ν g μ ν η μ ν h_(mu nu)-=g_(mu nu)-eta_(mu nu)h_{\mu \nu} \equiv g_{\mu \nu}-\eta_{\mu \nu}hμνgμνημν by equation (20.3), even deep inside the source where | h μ ν | h μ ν |h_(mu nu)|\left|h_{\mu \nu}\right||hμν| might be 1 1 ≳1\gtrsim 11. This form of the Einstein equations then permits a conversion of the Gaussian flux integrals into volume integrals, just as in linearized theory:
P μ = 1 16 π H , α μ α 0 j d 2 S j = 1 16 π H , α j μ α 0 j d 3 x = 1 16 π H μ α 0 β , α β d 3 x (20.15) = T eff μ 0 d 3 x P μ = 1 16 π H , α μ α 0 j d 2 S j = 1 16 π H , α j μ α 0 j d 3 x = 1 16 π H μ α 0 β , α β d 3 x (20.15) = T eff  μ 0 d 3 x {:[P^(mu)=(1)/(16 pi)ointH_(,alpha)^(mu alpha0j)d^(2)S_(j)=(1)/(16 pi)intH_(,alpha j)^(mu alpha0j)d^(3)x=(1)/(16 pi)intH^(mu alpha0beta)_(,alpha beta)d^(3)x],[(20.15)=intT_("eff ")^(mu0)d^(3)x]:}\begin{align*} P^{\mu} & =\frac{1}{16 \pi} \oint H_{, \alpha}^{\mu \alpha 0 j} d^{2} S_{j}=\frac{1}{16 \pi} \int H_{, \alpha j}^{\mu \alpha 0 j} d^{3} x=\frac{1}{16 \pi} \int H^{\mu \alpha 0 \beta}{ }_{, \alpha \beta} d^{3} x \\ & =\int T_{\text {eff }}^{\mu 0} d^{3} x \tag{20.15} \end{align*}Pμ=116πH,αμα0jd2Sj=116πH,αjμα0jd3x=116πHμα0β,αβd3x(20.15)=Teff μ0d3x
Similarly,
(20.16) J μ ν = ( x μ T eff ν 0 x ν T eff μ 0 ) d 3 x (20.16) J μ ν = x μ T eff ν 0 x ν T eff μ 0 d 3 x {:(20.16)J^(mu nu)=int(x^(mu)T_(eff)^(nu0)-x^(nu)T_(eff)^(mu0))d^(3)x:}\begin{equation*} J^{\mu \nu}=\int\left(x^{\mu} T_{\mathrm{eff}}^{\nu 0}-x^{\nu} T_{\mathrm{eff}}^{\mu 0}\right) d^{3} x \tag{20.16} \end{equation*}(20.16)Jμν=(xμTeffν0xνTeffμ0)d3x
[Crucial to the conversion is the use of partial derivatives rather than covariant derivatives in equations (20.14).] In these volume integrals, as throughout the preceeding discussion, the coordinates must become asymptotically Lorentz ( g μ ν η μ ν ) g μ ν η μ ν (g_(mu nu)rarreta_(mu nu))\left(g_{\mu \nu} \rightarrow \eta_{\mu \nu}\right)(gμνημν) far from the source.
The form of T eff μ ν T eff  μ ν T_("eff ")^(mu nu)T_{\text {eff }}^{\mu \nu}Teff μν can be calculated by recalling that H μ α ν β , α β H μ α ν β , α β H^(mu alpha nu beta)_(,alpha beta)H^{\mu \alpha \nu \beta}{ }_{, \alpha \beta}Hμανβ,αβ is a linearized approximation to the Einstein curvature tensor (20.5). Define the nonlinear corrections by
(20.17) 16 π t μ ν H μ α ν β α β 2 G μ ν (20.17) 16 π t μ ν H μ α ν β α β 2 G μ ν {:(20.17)16 pit^(mu nu)-=H^(mu alpha nu beta)_(alpha beta)-2G^(mu nu):}\begin{equation*} 16 \pi t^{\mu \nu} \equiv H^{\mu \alpha \nu \beta}{ }_{\alpha \beta}-2 G^{\mu \nu} \tag{20.17} \end{equation*}(20.17)16πtμνHμανβαβ2Gμν
(To calculate them in terms of g μ ν g μ ν g_(mu nu)g_{\mu \nu}gμν or h μ ν = g μ ν η μ ν h μ ν = g μ ν η μ ν h_(mu nu)=g_(mu nu)-eta_(mu nu)h_{\mu \nu}=g_{\mu \nu}-\eta_{\mu \nu}hμν=gμνημν is straightforward but lengthy. The precise form of these corrections will never be needed in this book.) Then Einstein's equations read
H μ α ν β , α β = 16 π t μ ν + 2 G μ ν = 16 π ( t μ ν + T μ ν ) , H μ α ν β , α β = 16 π t μ ν + 2 G μ ν = 16 π t μ ν + T μ ν , H^(mu alpha nu beta)_(,alpha beta)=16 pit^(mu nu)+2G^(mu nu)=16 pi(t^(mu nu)+T^(mu nu)),H^{\mu \alpha \nu \beta}{ }_{, \alpha \beta}=16 \pi t^{\mu \nu}+2 G^{\mu \nu}=16 \pi\left(t^{\mu \nu}+T^{\mu \nu}\right),Hμανβ,αβ=16πtμν+2Gμν=16π(tμν+Tμν),
so that
(20.18) T eff μ ν = T μ ν + t μ ν . (20.18) T eff  μ ν = T μ ν + t μ ν . {:(20.18)T_("eff ")^(mu nu)=T^(mu nu)+t^(mu nu).:}\begin{equation*} T_{\text {eff }}^{\mu \nu}=T^{\mu \nu}+t^{\mu \nu} . \tag{20.18} \end{equation*}(20.18)Teff μν=Tμν+tμν.
The quantity t μ ν t μ ν t^(mu nu)t^{\mu \nu}tμν is sometimes called a "stress-energy pseudotensor for the gravitational field." The Einstein field equations (20.14) imply, because H μ α ν β , α β H μ α ν β , α β H^(mu alpha nu beta)_(,alpha beta)H^{\mu \alpha \nu \beta}{ }_{, \alpha \beta}Hμανβ,αβ is antisymmetric in ν ν nu\nuν and β β beta\betaβ, that
(20.19) T eff , v μ ν = ( T μ ν + t μ ν ) , ν = 0 . (20.19) T eff  , v μ ν = T μ ν + t μ ν , ν = 0 . {:(20.19)T_("eff ",v)^(mu nu)=(T^(mu nu)+t^(mu nu))_(,nu)=0.:}\begin{equation*} T_{\text {eff }, v}^{\mu \nu}=\left(T^{\mu \nu}+t^{\mu \nu}\right)_{, \nu}=0 . \tag{20.19} \end{equation*}(20.19)Teff ,vμν=(Tμν+tμν),ν=0.
These equations are equivalent to T μ ν ; ν = 0 T μ ν ; ν = 0 T^(mu nu)_(;nu)=0T^{\mu \nu}{ }_{; \nu}=0Tμν;ν=0, but they are written with partial derivatives rather than covariant derivatives-a fact that permits conversions back and forth between volume integrals and surface integrals.
All the quantities H μ α ν β , T eff μ ν H μ α ν β , T eff  μ ν H^(mu alpha nu beta),T_("eff ")^(mu nu)H^{\mu \alpha \nu \beta}, T_{\text {eff }}^{\mu \nu}Hμανβ,Teff μν, and t μ ν t μ ν t^(mu nu)t^{\mu \nu}tμν depend for their definition and existence on the choice of coordinates; they have no existence independent of coordinates; they are not components of tensors or of any other geometric object. Correspondingly, the equations (20.14) to (20.19) involving T eff μ ν T eff  μ ν T_("eff ")^(mu nu)T_{\text {eff }}^{\mu \nu}Teff μν and t μ ν t μ ν t^(mu nu)t^{\mu \nu}tμν have no geometric, coordinate-free significance; they are not "covariant tensor equations." There is, nevertheless, adequate invariance under general coordinate transformations to give the values P μ P μ P^(mu)P^{\mu}Pμ and J μ ν J μ ν J^(mu nu)J^{\mu \nu}Jμν of the volume integrals (20.15) and (20.16) geometric, coor-dinate-free significance in the asymptotically flat region far outside the source. Although this invariance is hard to see in the volume integrals themselves, it is clear from the surface-integral forms (20.9) that no coordinate transformation which changes the coordinates only inside some spatially bounded region can influence the values of the integrals. For coordinate changes in the distant, asymptotically flat regions, linearized theory guarantees that under Lorentz transformations the integrals for P μ P μ P^(mu)P^{\mu}Pμ and J μ ν J μ ν J^(mu nu)J^{\mu \nu}Jμν will transform like special relativistic tensors, and that under infinitesimal coordinate transformations (gauge changes) they will be invariant.
Because t μ ν t μ ν t^(mu nu)t^{\mu \nu}tμν are not tensor components, they can vanish at a point in one coordinate system but not in another. The resultant ambiguity in defining a localized energy density t 00 t 00 t^(00)t^{00}t00 for the gravitational field has a counterpart in ambiguities that exist in
t }\mp@subsup{}{}{\mu\nu}\mathrm{ ("'stress-energy
pseudotensor'") defined
T eff μ ν T eff  μ ν T_("eff ")^(mu nu)T_{\text {eff }}^{\mu \nu}Teff μν defined
Conservation law for T eff μ ν T eff  μ ν T_("eff ")^(mu nu)T_{\text {eff }}^{\mu \nu}Teff μν
H μ α ν β , t μ ν H μ α ν β , t μ ν H^(mu alpha nu beta),t^(mu nu)H^{\mu \alpha \nu \beta}, t^{\mu \nu}Hμανβ,tμν, and T eff μ ν T eff  μ ν T_("eff ")^(mu nu)T_{\text {eff }}^{\mu \nu}Teff μν are coordinate-dependent objects
Other, equally good versions of H μ α ν β , t μ ν , T eff μ ν H μ α ν β , t μ ν , T eff  μ ν H^(mu alpha nu beta),t^(mu nu),T_("eff ")^(mu nu)H^{\mu \alpha \nu \beta}, t^{\mu \nu}, T_{\text {eff }}^{\mu \nu}Hμανβ,tμν,Teff μν :
(1) H L L uap β H L L uap β H_(L-L)^(uap beta)H_{\mathrm{L}-\mathrm{L}}^{\operatorname{uap} \beta}HLLuapβ
(2) t L L α β t L L α β t_(L-L)^(alpha beta)t_{\mathrm{L}-\mathrm{L}}^{\alpha \beta}tLLαβ
(3) T L Leff μ ν T L Leff μ ν T_(L-Leff)^(mu nu)T_{\mathrm{L}-\mathrm{Leff}}^{\mu \nu}TLLeffμν
the formal definition of t μ ν t μ ν t^(mu nu)t^{\mu \nu}tμν. It is clear that any quantities H new μ α ν β H new  μ α ν β H_("new ")^(mu alpha nu beta)H_{\text {new }}^{\mu \alpha \nu \beta}Hnew μανβ which agree with the original H μ α ν β H μ α ν β H^(mu alpha nu beta)H^{\mu \alpha \nu \beta}Hμανβ in the asymptotic weak-field region will give the same values as H μ α ν β H μ α ν β H^(mu alpha nu beta)H^{\mu \alpha \nu \beta}Hμανβ does for the P μ P μ P^(mu)P^{\mu}Pμ and J μ ν J μ ν J^(mu nu)J^{\mu \nu}Jμν surface integrals (20.9). One especially convenient choice has been given by Landau and Lifshitz (1962; §100), who define
(20.20) H L L μ α ν β = g μ ν g α β g α ν g μ β (20.20) H L L μ α ν β = g μ ν g α β g α ν g μ β {:(20.20)H_(L-L)^(mu alpha nu beta)=g^(mu nu)g^(alpha beta)-g^(alpha nu)g^(mu beta):}\begin{equation*} H_{\mathrm{L}-\mathrm{L}}^{\mu \alpha \nu \beta}=\mathfrak{g}^{\mu \nu} \mathfrak{g}^{\alpha \beta}-\mathfrak{g}^{\alpha \nu} \mathfrak{g}^{\mu \beta} \tag{20.20} \end{equation*}(20.20)HLLμανβ=gμνgαβgανgμβ
where g μ ν ( g ) 1 / 2 g μ ν g μ ν ( g ) 1 / 2 g μ ν g^(mu nu)-=(-g)^(1//2)g^(mu nu)\mathfrak{g}^{\mu \nu} \equiv(-g)^{1 / 2} g^{\mu \nu}gμν(g)1/2gμν. Landau and Lifshitz show that Einstein's equations can be written in the form
(20.21) H L L , α β μ α ν β = 16 π ( g ) ( T μ ν + t L L μ ν ) (20.21) H L L , α β μ α ν β = 16 π ( g ) T μ ν + t L L μ ν {:(20.21)H_(L-L,alpha beta)^(mu alpha nu beta)=16 pi(-g)(T^(mu nu)+t_(L-L)^(mu nu)):}\begin{equation*} H_{\mathrm{L}-\mathrm{L}, \alpha \beta}^{\mu \alpha \nu \beta}=16 \pi(-g)\left(T^{\mu \nu}+t_{\mathrm{L}-\mathrm{L}}^{\mu \nu}\right) \tag{20.21} \end{equation*}(20.21)HLL,αβμανβ=16π(g)(Tμν+tLLμν)
where the Landau-Lifshitz pseudotensor components
( g ) t L L α β = 1 16 π { g α β , λ g λ μ , μ g α λ , , λ g β μ , μ + 1 2 g α β g λ μ g λ ν , ρ g ρ μ , ν ( g α λ g μ ν g β ν , ρ g μ ρ , λ + g β λ g μ ν g , ρ α ν g μ ρ , λ ) + g λ μ g ν ρ g α λ , , α g β μ , ρ (20.22) + 1 8 ( 2 g α λ g β μ g α β g λ μ ) ( 2 g ν ρ g σ τ g ρ σ g ν τ ) g ν τ , λ g ρ σ , μ } ( g ) t L L α β = 1 16 π g α β , λ g λ μ , μ g α λ , , λ g β μ , μ + 1 2 g α β g λ μ g λ ν , ρ g ρ μ , ν g α λ g μ ν g β ν , ρ g μ ρ , λ + g β λ g μ ν g , ρ α ν g μ ρ , λ + g λ μ g ν ρ g α λ , , α g β μ , ρ (20.22) + 1 8 2 g α λ g β μ g α β g λ μ 2 g ν ρ g σ τ g ρ σ g ν τ g ν τ , λ g ρ σ , μ {:[(-g)t_(L-L)^(alpha beta)=(1)/(16 pi){g^(alpha beta)_(,lambda)g^(lambda mu)_(,mu)-g^(alpha lambda),_(,lambda)g^(beta mu)_(,mu)+(1)/(2)g^(alpha beta)g_(lambda mu)g^(lambda nu)_(,rho)g^(rho mu)_(,nu):}],[-(g^(alpha lambda)g_(mu nu)g^(beta nu)_(,rho)g^(mu rho)_(,lambda)+g^(beta lambda)g_(mu nu)g_(,rho)^(alpha nu)g^(mu rho)_(,lambda))+g_(lambda mu)g^(nu rho)g^(alpha lambda)","","^(alpha)g^(beta mu)_(,rho)],[(20.22){:+(1)/(8)(2g^(alpha lambda)g^(beta mu)-g^(alpha beta)g^(lambda mu))(2g_(nu rho)g_(sigma tau)-g_(rho sigma)g_(nu tau))g^(nu tau),lambdag^(rho sigma)_(,mu)}]:}\begin{align*} (-g) t_{\mathrm{L}-\mathrm{L}}^{\alpha \beta}= & \frac{1}{16 \pi}\left\{\mathfrak{g}^{\alpha \beta}{ }_{, \lambda} \mathfrak{g}^{\lambda \mu}{ }_{, \mu}-\mathfrak{g}^{\alpha \lambda},{ }_{, \lambda} \mathfrak{g}^{\beta \mu}{ }_{, \mu}+\frac{1}{2} g^{\alpha \beta} g_{\lambda \mu} \mathfrak{g}^{\lambda \nu}{ }_{, \rho} \mathfrak{g}^{\rho \mu}{ }_{, \nu}\right. \\ & -\left(g^{\alpha \lambda} g_{\mu \nu} \mathfrak{g}^{\beta \nu}{ }_{, \rho} \mathfrak{g}^{\mu \rho}{ }_{, \lambda}+g^{\beta \lambda} g_{\mu \nu} \mathfrak{g}_{, \rho}^{\alpha \nu} \mathfrak{g}^{\mu \rho}{ }_{, \lambda}\right)+g_{\lambda \mu} g^{\nu \rho} \mathfrak{g}^{\alpha \lambda},,{ }^{\alpha} \mathfrak{g}^{\beta \mu}{ }_{, \rho} \\ & \left.+\frac{1}{8}\left(2 g^{\alpha \lambda} g^{\beta \mu}-g^{\alpha \beta} g^{\lambda \mu}\right)\left(2 g_{\nu \rho} g_{\sigma \tau}-g_{\rho \sigma} g_{\nu \tau}\right) \mathfrak{g}^{\nu \tau}, \lambda \mathfrak{g}^{\rho \sigma}{ }_{, \mu}\right\} \tag{20.22} \end{align*}(g)tLLαβ=116π{gαβ,λgλμ,μgαλ,,λgβμ,μ+12gαβgλμgλν,ρgρμ,ν(gαλgμνgβν,ρgμρ,λ+gβλgμνg,ρανgμρ,λ)+gλμgνρgαλ,,αgβμ,ρ(20.22)+18(2gαλgβμgαβgλμ)(2gνρgστgρσgντ)gντ,λgρσ,μ}
are precisely quadratic in the first derivatives of the metric. (Einstein also gave a pseudotensor t E μ ν t E μ ν t_(E)^(mu)_(nu)t_{E}{ }^{\mu}{ }_{\nu}tEμν with this property, but it was not symmetric and so did not lead to an integral for J μ ν J μ ν J^(mu nu)J^{\mu \nu}Jμν.) Because H L L μ α β H L L μ α β H_(L-L)^(mu alpha beta)H_{\mathrm{L}-\mathrm{L}}^{\mu \alpha \beta}HLLμαβ has the same symmetries as H μ α ν β H μ α ν β H^(mu alpha nu beta)H^{\mu \alpha \nu \beta}Hμανβ and equals H μ α ν β H μ α ν β H^(mu alpha nu beta)H^{\mu \alpha \nu \beta}Hμανβ far from the source (exercise 20.4), and because the field equations (20.21) in terms of H L L μ α β H L L μ α β H_(L-L)^(mu alpha beta)H_{\mathrm{L}-\mathrm{L}}^{\mu \alpha \beta}HLLμαβ have the same form as in terms of H μ α ν β H μ α ν β H^(mu alpha nu beta)H^{\mu \alpha \nu \beta}Hμανβ, it follows that
(20.23a) T L Leff μ ν ( g ) ( T μ ν + t L L μ ν ) (20.23a) T L Leff μ ν ( g ) T μ ν + t L L μ ν {:(20.23a)T_(L-Leff)^(mu nu)-=(-g)(T^(mu nu)+t_(L-L)^(mu nu)):}\begin{equation*} T_{\mathrm{L}-\mathrm{Leff}}^{\mu \nu} \equiv(-g)\left(T^{\mu \nu}+t_{\mathrm{L}-\mathrm{L}}^{\mu \nu}\right) \tag{20.23a} \end{equation*}(20.23a)TLLeffμν(g)(Tμν+tLLμν)
has all the properties of the T eff μ ν T eff  μ ν T_("eff ")^(mu nu)T_{\text {eff }}^{\mu \nu}Teff μν introduced earlier in this section:
(20.23b) T L Leff , ν μ ν = 0 (20.23c) P μ = T L Leff μ 0 d 3 x (20.23~d) J μ ν = ( x μ T L Leff ν 0 x ν T L Leff μ 0 ) d 3 x (20.23b) T L Leff , ν μ ν = 0 (20.23c) P μ = T L Leff μ 0 d 3 x (20.23~d) J μ ν = x μ T L Leff ν 0 x ν T L Leff μ 0 d 3 x {:[(20.23b)T_(L-Leff,nu)^(mu nu)=0],[(20.23c)P^(mu)=intT_(L-Leff)^(mu0)d^(3)x],[(20.23~d)J^(mu nu)=int(x^(mu)T_(L-Leff)^(nu0)-x^(nu)T_(L-Leff)^(mu0))d^(3)x]:}\begin{gather*} T_{\mathrm{L}-\mathrm{Leff}, \nu}^{\mu \nu}=0 \tag{20.23b}\\ P^{\mu}=\int T_{\mathrm{L}-\mathrm{Leff}}^{\mu 0} d^{3} x \tag{20.23c}\\ J^{\mu \nu}=\int\left(x^{\mu} T_{\mathrm{L}-\mathrm{Leff}}^{\nu 0}-x^{\nu} T_{\mathrm{L}-\mathrm{Leff}}^{\mu 0}\right) d^{3} x \tag{20.23~d} \end{gather*}(20.23b)TLLeff,νμν=0(20.23c)Pμ=TLLeffμ0d3x(20.23~d)Jμν=(xμTLLeffν0xνTLLeffμ0)d3x

EXERCISE

Exercise 20.4. FORM OF H L L ap β H L L ap β H_(L-L)^(ap beta)H_{\mathrm{L}-\mathrm{L}}^{\operatorname{ap} \beta}HLLapβ FAR FROM SOURCE
Show that the entities H L L μ ν β H L L μ ν β H_(L-L)^(mu nu beta)H_{\mathrm{L}-\mathrm{L}}^{\mu \nu \beta}HLLμνβ of equations (20.20) reduce to H μ α ν β H μ α ν β H^(mu alpha nu beta)H^{\mu \alpha \nu \beta}Hμανβ (20.3) in the weak-field region far outside the source.

§20.4. WHY THE ENERGY OF THE GRAVITATIONAL FIELD CANNOT BE LOCALIZED

Consider an element of 3-volume d Σ p d Σ p dSigma_(p)d \Sigma_{p}dΣp and evaluate the contribution of the "gravitational field" in that element of 3 -volume to the energy-momentum 4 -vector, using
in the calculation either the pseudotensor t μ ν t μ ν t^(mu nu)t^{\mu \nu}tμν or the pseudotensor t L L μ ν t L L μ ν t_(L-L)^(mu nu)t_{\mathrm{L}-\mathrm{L}}^{\mu \nu}tLLμν discussed in the last section. Thereby obtain
p = e μ t μ ν d Σ ν p = e μ t μ ν d Σ ν p=e_(mu)t^(mu nu)dSigma_(nu)\boldsymbol{p}=\boldsymbol{e}_{\mu} t^{\mu \nu} d \Sigma_{\nu}p=eμtμνdΣν
or
p = e μ μ L L 2 ν d Σ ν p = e μ μ L L 2 ν d Σ ν p=e_(mu)mu_(L-L)^(2nu)dSigma_(nu)\boldsymbol{p}=\boldsymbol{e}_{\mu} \mu_{\mathrm{L}-\mathrm{L}}^{2 \nu} d \Sigma_{\nu}p=eμμLL2νdΣν
Right? No, the question is wrong. The motivation is wrong. The result is wrong. The idea is wrong.
To ask for the amount of electromagnetic energy and momentum in an element of 3-volume makes sense. First, there is one and only one formula for this quantity. Second, and more important, this energy-momentum in principle "has weight." It curves space. It serves as a source term on the righthand side of Einstein's field equations. It produces a relative geodesic deviation of two nearby world lines that pass through the region of space in question. It is observable. Not one of these properties does "local gravitational energy-momentum" possess. There is no unique formula for it, but a multitude of quite distinct formulas. The two cited are only two among an infinity. Moreover, "local gravitational energy-momentum" has no weight. It does not curve space. It does not serve as a source term on the righthand side of Einstein's field equations. It does not produce any relative geodesic deviation of two nearby world lines that pass through the region of space in question. It is not observable.
Anybody who looks for a magic formula for "local gravitational energy-momentum" is looking for the right answer to the wrong question. Unhappily, enormous time and effort were devoted in the past to trying to "answer this question" before investigators realized the futility of the enterprise. Toward the end, above all mathematical arguments, one came to appreciate the quiet but rock-like strength of Einstein's equivalence principle. One can always find in any given locality a frame of reference in which all local "gravitational fields" (all Christoffel symbols; all Γ α μ ν Γ α μ ν Gamma^(alpha)_(mu nu)\Gamma^{\alpha}{ }_{\mu \nu}Γαμν ) disappear. No Γ Γ Gamma\GammaΓ 's means no "gravitational field" and no local gravitational field means no "local gravitational energy-momentum."
Nobody can deny or wants to deny that gravitational forces make a contribution to the mass-energy of a gravitationally interacting system. The mass-energy of the Earth-moon system is less than the mass-energy that the system would have if the two objects were at infinite separation. The mass-energy of a neutron star is less than the mass-energy of the same number of baryons at infinite separation. Surrounding a region of empty space where there is a concentration of gravitational waves, there is a net attraction, betokening a positive net mass-energy in that region of space (see Chapter 35). At issue is not the existence of gravitational energy, but the localizability of gravitational energy. It is not localizable. The equivalence principle forbids.
Look at an old-fashioned potato, replete with warts and bumps. With an orange marking pen, mark on it a "North Pole" and an "equator". The length of the equator is very far from being equal to 2 π 2 π 2pi2 \pi2π times the distance from the North Pole to the
Why one cannot define a localized energy-momentum for the gravitational field
equator. The explanation, "curvature," is simple, just as the explanation, "gravitation", for the deficit in mass of the earth-moon system (or deficit for the neutron star, or surplus for the region of space occupied by the gravitational waves) is simple. Yet it is not possible to ascribe the deficit in the length of the equator in the one case, or in mass in the other case, in any uniquely right way to different elements of the manifold (2-dimensional in the one case, 3-dimensional in the other). Look at a small region on the surface of the potato. The geometry there is locally flat. Look at any small region of space in any of the three gravitating systems. In an appropriate coordinate system it is free of gravitational field. The over-all effect one is looking at is a global effect, not a local effect. That is what the mathematics cries out. That is the lesson of the nonuniqueness of the t μ ν t μ ν t^(mu nu)t^{\mu \nu}tμν !

§20.5. CONSERVATION LAWS FOR TOTAL 4-MOMENTUM AND ANGULAR MOMENTUM

Consider a system such as our galaxy or the solar system, which is made up of many gravitating bodies. Some of the bodies may be highly relativistic (black holes; neutron stars), while others are not. However, insist that in the regions between the bodies spacetime be nearly flat (gravity be weak)-so flat, in fact, that one can cover the entire system with coordinates which are (almost) globally inertial, except in a small neighborhood of each body where gravity may be strong. Such coordinates can exist only if the Newtonian gravitational potential, Φ 1 2 ( η 00 g 00 ) Φ 1 2 η 00 g 00 Phi~~(1)/(2)(eta_(00)-g_(00))\Phi \approx \frac{1}{2}\left(\eta_{00}-g_{00}\right)Φ12(η00g00), in the interbody region is small:
ϕ interbody ( Mass of system ) / ( radius of system ) 1 . ϕ interbody  (  Mass of system  ) / (  radius of system  ) 1 . phi_("interbody ")∼(" Mass of system ")//(" radius of system ")≪1.\phi_{\text {interbody }} \sim(\text { Mass of system }) /(\text { radius of system }) \ll 1 .ϕinterbody ( Mass of system )/( radius of system )1.
The solar system certainly satisfies this condition ( Φ interbody 10 7 Φ interbody  10 7 Phi_("interbody ")∼10^(-7)\Phi_{\text {interbody }} \sim 10^{-7}Φinterbody 107 ), as does the Galaxy ( Φ interbody 10 6 Φ interbody  10 6 Phi_("interbody ")∼10^(-6)\Phi_{\text {interbody }} \sim 10^{-6}Φinterbody 106 ), as do clusters of galaxies ( Φ interbody 10 6 Φ interbody  10 6 Phi_("interbody ")∼10^(-6)\Phi_{\text {interbody }} \sim 10^{-6}Φinterbody 106 ); but the universe as a whole does not ( Φ interbody 1 ) Φ interbody  1 {:Phi_("interbody ")∼1)\left.\Phi_{\text {interbody }} \sim 1\right)Φinterbody 1) !
In evaluating volume integrals for the system's total 4-momentum, split its volume into a region containing each body (denoted " A A AAA ") plus an interbody region; and neglect the pseudotensor contribution from the almost-flat interbody region:
(20.24a) P system μ = A A T eff μ 0 d 3 x + interbody region T eff μ 0 d 3 x = A P A μ + interbody region T μ 0 d 3 x . (20.24a) P system  μ = A A T eff  μ 0 d 3 x +  interbody   region  T eff  μ 0 d 3 x = A P A μ +  interbody   region  T μ 0 d 3 x . {:[(20.24a)P_("system ")^(mu)=sum_(A)int_(A)T_("eff ")^(mu0)d^(3)x+int_({:[" interbody "],[" region "]:})T_("eff ")^(mu0)d^(3)x],[=sum_(A)P_(A)^(mu)+int_({:[" interbody "],[" region "]:})T^(mu0)d^(3)x.]:}\begin{align*} P_{\text {system }}^{\mu} & =\sum_{A} \int_{A} T_{\text {eff }}^{\mu 0} d^{3} x+\int_{\substack{\text { interbody } \\ \text { region }}} T_{\text {eff }}^{\mu 0} d^{3} x \tag{20.24a}\\ & =\sum_{A} P_{A}^{\mu}+\int_{\substack{\text { interbody } \\ \text { region }}} T^{\mu 0} d^{3} x . \end{align*}(20.24a)Psystem μ=AATeff μ0d3x+ interbody  region Teff μ0d3x=APAμ+ interbody  region Tμ0d3x.
Because spacetime is asymptotically flat around each body, P A μ P A μ P_(A)^(mu)P_{A}{ }^{\mu}PAμ is the 4-momentum of body A A AAA as measured gravitationally by an experimenter near it. The integral of T μ 0 T μ 0 T^(mu0)T^{\mu 0}Tμ0 over the interbody region is the contribution of any gas, particles, or magnetic
fields out there to the total 4-momentum. A similar breakup of the angular momentum reads
(20.24b) J system μ ν = A J A μ ν + interbody region ( x μ T ν 0 x ν T μ 0 ) d 3 x . (20.24b) J system  μ ν = A J A μ ν +  interbody   region  x μ T ν 0 x ν T μ 0 d 3 x . {:(20.24b)J_("system ")^(mu nu)=sum_(A)J_(A)^(mu nu)+int_({:[" interbody "],[" region "]:})(x^(mu)T^(nu0)-x^(nu)T^(mu0))d^(3)x.:}\begin{equation*} J_{\text {system }}^{\mu \nu}=\sum_{A} J_{A}^{\mu \nu}+\int_{\substack{\text { interbody } \\ \text { region }}}\left(x^{\mu} T^{\nu 0}-x^{\nu} T^{\mu 0}\right) d^{3} x . \tag{20.24b} \end{equation*}(20.24b)Jsystem μν=AJAμν+ interbody  region (xμTν0xνTμ0)d3x.
In operational terms, these breakups show that the total 4-momentum and angular momentum of the system, as measured gravitationally by an experimenter outside it, are sums of P μ P μ P^(mu)P^{\mu}Pμ and J μ v J μ v J^(mu v)J^{\mu v}Jμv for each individual body, as measured gravitationally by an experimenter near it, plus contributions of the usual special-relativistic type from the interbody matter and fields. This is true even if some of the bodies are hurtling through the system with speeds near that of light; their gravitationally measured P μ P μ P^(mu)P^{\mu}Pμ and J μ ν J μ ν J^(mu nu)J^{\mu \nu}Jμν contribute, on an equal footing with anyone else's, to the system's total P μ P μ P^(mu)P^{\mu}Pμ and J μ ν J μ ν J^(mu nu)J^{\mu \nu}Jμν !
Surround this asymptotically flat system by a two-dimensional surface S S SSS that is at rest in some asymptotic Lorentz frame. Then the 4 -momentum and angular momentum inside S S SSS change at a rate (as measured in S S SSS 's rest frame) given by
d P μ d t = d d t T eff μ 0 d 3 x = T eff , 0 μ 0 d 3 x = T eff , j μ j d 3 x (20.25) = T eff μ j d 2 S j d P μ d t = d d t T eff μ 0 d 3 x = T eff , 0 μ 0 d 3 x = T eff , j μ j d 3 x (20.25) = T eff μ j d 2 S j {:[(dP^(mu))/(dt)=(d)/(dt)intT_(eff)^(mu0)d^(3)x=intT_(eff,0)^(mu0)d^(3)x=-intT_(eff,j)^(mu j)d^(3)x],[(20.25)=-ointT_(eff)^(mu j)d^(2)S_(j)]:}\begin{align*} \frac{d P^{\mu}}{d t} & =\frac{d}{d t} \int T_{\mathrm{eff}}^{\mu 0} d^{3} x=\int T_{\mathrm{eff}, 0}^{\mu 0} d^{3} x=-\int T_{\mathrm{eff}, j}^{\mu j} d^{3} x \\ & =-\oint T_{\mathrm{eff}}^{\mu j} d^{2} S_{j} \tag{20.25} \end{align*}dPμdt=ddtTeffμ0d3x=Teff,0μ0d3x=Teff,jμjd3x(20.25)=Teffμjd2Sj
and similarly
(20.26) d J μ ν d t = S 2 ( x μ T eff v j x ν T eff μ j ) d 2 S j (20.26) d J μ ν d t = S 2 x μ T eff  v j x ν T eff  μ j d 2 S j {:(20.26)(dJ^(mu nu))/(dt)=-oint_(S_(2))(x^(mu)T_("eff ")^(vj)-x^(nu)T_("eff ")^(mu j))d^(2)S_(j):}\begin{equation*} \frac{d J^{\mu \nu}}{d t}=-\oint_{S_{2}}\left(x^{\mu} T_{\text {eff }}^{v j}-x^{\nu} T_{\text {eff }}^{\mu j}\right) d^{2} S_{j} \tag{20.26} \end{equation*}(20.26)dJμνdt=S2(xμTeff vjxνTeff μj)d2Sj
Although the pseudotensor t μ ν t μ ν t^(mu nu)t^{\mu \nu}tμν, in the interbody region and outside the system, contributes negligibly to the total 4 -momentum and angular momentum (by assumption), its contribution via gravitational waves to the time derivatives d P μ / d t d P μ / d t dP^(mu)//dtd P^{\mu} / d tdPμ/dt and d J μ ν / d t d J μ ν / d t dJ^(mu nu)//dtd J^{\mu \nu} / d tdJμν/dt can be important when added up over astronomical periods of time. Thus, one must not ignore it in the flux integrals (20.25), (20.26).
In evaluating these flux integrals, it is especially convenient to use the LandauLifshitz form of T eff μ ν T eff  μ ν T_("eff ")^(mu nu)T_{\text {eff }}^{\mu \nu}Teff μν, since that form contains no second derivatives of the metric. Thus set
T eff μ ν = ( g ) ( T μ ν + t L L μ ν ) ( T μ ν + t L L μ ν ) T eff  μ ν = ( g ) T μ ν + t L L μ ν T μ ν + t L L μ ν T_("eff ")^(mu nu)=(-g)(T^(mu nu)+t_(L-L)^(mu nu))~~(T^(mu nu)+t_(L-L)^(mu nu))T_{\text {eff }}^{\mu \nu}=(-g)\left(T^{\mu \nu}+t_{\mathrm{L}-\mathrm{L}}^{\mu \nu}\right) \approx\left(T^{\mu \nu}+t_{\mathrm{L}-\mathrm{L}}^{\mu \nu}\right)Teff μν=(g)(Tμν+tLLμν)(Tμν+tLLμν)
where t L L μ ν t L L μ ν t_(L-L)^(mu nu)t_{\mathrm{L}-\mathrm{L}}^{\mu \nu}tLLμν are given by equations (20.22). Only those portions of t L L μ ν t L L μ ν t_(L-L)^(mu nu)t_{\mathrm{L}-\mathrm{L}}^{\mu \nu}tLLμν that die out as 1 / r 2 1 / r 2 1//r^(2)1 / r^{2}1/r2 or 1 / r 3 1 / r 3 1//r^(3)1 / r^{3}1/r3 at large r r rrr can contribute to the flux integrals (20.25), (20.26). For static solutions [ g μ ν g μ ν [g_(mu nu)∼:}\left[\mathfrak{g}_{\mu \nu} \sim\right.[gμν const. + O ( 1 / r ) ] , t L L μ ν + O ( 1 / r ) , t L L μ ν {:+O(1//r)],t_(L-L)^(mu nu)\left.+O(1 / r)\right], t_{\mathrm{L}-\mathrm{L}}^{\mu \nu}+O(1/r)],tLLμν dies out as 1 / r 4 1 / r 4 1//r^(4)1 / r^{4}1/r4. Hence, the only contributions come from dynamic parts of the metric, which, at these large distances, are entirely in the form of gravitational waves. The study of gravitational waves in Chapter 35 will reveal that when t L L μ ν t L L μ ν t_(L-L)^(mu nu)t_{\mathrm{L}-\mathrm{L}}^{\mu \nu}tLLμν is averaged over several wavelengths, it becomes a stress-energy tensor T ( GW ) μ ν T ( GW ) μ ν T^((GW)mu nu)T^{(\mathrm{GW}) \mu \nu}T(GW)μν for the waves, which has all the properties one ever requires of any stress-energy tensor. (For example, via Einstein's equations
Rates of change of total 4-momentum and angular momentum:
(1) expressed as flux integrals of T eff μ ν T eff  μ ν T_("eff ")^(mu nu)T_{\text {eff }}^{\mu \nu}Teff μν
G ( B ) μ ν = 8 π T ( GW ) μ ν G ( B ) μ ν = 8 π T ( GW ) μ ν G^((B)mu nu)=8piT^((GW)mu nu)G^{(\mathrm{B}) \mu \nu}=8 \pi T^{(\mathrm{GW}) \mu \nu}G(B)μν=8πT(GW)μν, it contributes to the "background" curvature of the spacetime through which the waves propagate.) Moreover, averaging t L L μ ν t L L μ ν t_(L-L)^(mu nu)t_{L-L}^{\mu \nu}tLLμν over several wavelengths before evaluating the flux integrals (20.25), (20.26) cannot affect the values of the integrals. Therefore, one can freely make in these integrals the replacement
T eff μ ν = T μ ν + T ( GW ) μ ν , T eff μ ν = T μ ν + T ( GW ) μ ν , T_(eff)^(mu nu)=T^(mu nu)+T^((GW)mu nu),T_{\mathrm{eff}}^{\mu \nu}=T^{\mu \nu}+T^{(\mathrm{GW}) \mu \nu},Teffμν=Tμν+T(GW)μν,
thereby obtaining
(2) expressed as flux integrals of T μ ν + T ( GW ) μ ν T μ ν + T ( GW ) μ ν T^(mu nu)+T^((GW)mu nu)T^{\mu \nu}+T^{(\mathrm{GW}) \mu \nu}Tμν+T(GW)μν
(20.27) d P μ d t = S ( T μ j + T ( GW ) μ j ) d 2 S j (20.28) d J μ ν d t = S [ x μ ( T ν j + T ( GW ) v j ) x ν ( T μ j + T ( GW ) μ j ) ] d 2 S j . (20.27) d P μ d t = S T μ j + T ( GW ) μ j d 2 S j (20.28) d J μ ν d t = S x μ T ν j + T ( GW ) v j x ν T μ j + T ( GW ) μ j d 2 S j . {:[(20.27)-(dP^(mu))/(dt)=oint_(S)(T^(mu j)+T^((GW)mu j))d^(2)S_(j)],[(20.28)-(dJ^(mu nu))/(dt)=oint_(S)[x^(mu)(T^(nu j)+T^((GW)vj))-x^(nu)(T^(mu j)+T^((GW)mu j))]d^(2)S_(j).]:}\begin{gather*} -\frac{d P^{\mu}}{d t}=\oint_{S}\left(T^{\mu j}+T^{(\mathrm{GW}) \mu j}\right) d^{2} S_{j} \tag{20.27}\\ -\frac{d J^{\mu \nu}}{d t}=\oint_{S}\left[x^{\mu}\left(T^{\nu j}+T^{(\mathrm{GW}) v j}\right)-x^{\nu}\left(T^{\mu j}+T^{(\mathrm{GW}) \mu j}\right)\right] d^{2} S_{j} . \tag{20.28} \end{gather*}(20.27)dPμdt=S(Tμj+T(GW)μj)d2Sj(20.28)dJμνdt=S[xμ(Tνj+T(GW)vj)xν(Tμj+T(GW)μj)]d2Sj.
These are tensor equations in the asymptotically flat spacetime surrounding the system. All reference to pseudotensors and other nontensorial entities has disappeared.
Equations (20.27) and (20.28) say that the rate of loss of 4-momentum and angular momentum from the system, as measured gravitationally, is precisely equal to the rate at which matter, fields, and gravitational waves carry off 4-momentum and angular momentum.
This theorem is extremely useful in thought experiments where one imagines changing the 4 -momentum or angular momentum of a highly relativistic body (e.g., a rotating neutron star) by throwing particles onto it from far away [see, e.g., Hartle (1970)].

EXERCISE

Exercise 20.5. TOTAL MASS-ENERGY IN NEWTONIAN LIMIT

(a) Calculate t L L α β t L L α β t_(L-L)^(alpha beta)t_{\mathrm{L}-\mathrm{L}}^{\alpha \beta}tLLαβ for the nearly Newtonian metric
d s 2 = ( 1 + 2 Φ ) d t 2 + ( 1 2 Φ ) δ j k d x j d x k d s 2 = ( 1 + 2 Φ ) d t 2 + ( 1 2 Φ ) δ j k d x j d x k ds^(2)=-(1+2Phi)dt^(2)+(1-2Phi)delta_(jk)dx^(j)dx^(k)d s^{2}=-(1+2 \Phi) d t^{2}+(1-2 \Phi) \delta_{j k} d x^{j} d x^{k}ds2=(1+2Φ)dt2+(12Φ)δjkdxjdxk
(see § 18.4 § 18.4 §18.4\S 18.4§18.4 ). Assume the source is slowly changing, so that time derivatives of Φ Φ Phi\PhiΦ can be neglected compared to space derivatives. [Answer:
t L L 00 = 7 8 π Φ , j Φ , j t L L 0 j = 0 (20.29) t L L j k = 1 4 π ( Φ , j Φ , k 1 2 δ j k Φ , Φ , ) ] t L L 00 = 7 8 π Φ , j Φ , j t L L 0 j = 0 (20.29) t L L j k = 1 4 π Φ , j Φ , k 1 2 δ j k Φ , Φ , {:[t_(L-L)^(00)=-(7)/(8pi)Phi_(,j)Phi_(,j)],[t_(L-L)^(0j)=0],[(20.29){:t_(L-L)^(jk)=(1)/(4pi)(Phi_(,j)Phi_(,k)-(1)/(2)delta_(jk)Phi_(,ℓ)Phi_(,ℓ))*]]:}\begin{align*} & t_{\mathrm{L}-\mathrm{L}}^{00}=-\frac{7}{8 \pi} \Phi_{, j} \Phi_{, j} \\ & t_{\mathrm{L}-\mathrm{L}}^{0 j}=0 \\ & \left.t_{\mathrm{L}-\mathrm{L}}^{j k}=\frac{1}{4 \pi}\left(\Phi_{, j} \Phi_{, k}-\frac{1}{2} \delta_{j k} \Phi_{, \ell} \Phi_{, \ell}\right) \cdot\right] \tag{20.29} \end{align*}tLL00=78πΦ,jΦ,jtLL0j=0(20.29)tLLjk=14π(Φ,jΦ,k12δjkΦ,Φ,)]
(Note: t L L i k t L L i k t_(L-L)^(ik)t_{\mathrm{L}-\mathrm{L}}^{i k}tLLik as given here is the "stress tensor for a Newtonian gravitational field"; cf. exercises 39.5 and 39.6.)
(b) Let the source of the gravitational field be a perfect fluid with
T μ ν = ( ρ + p ) u μ u ν + p g μ ν , p / ρ v 2 ( d x / d t ) 2 | Φ | . T μ ν = ( ρ + p ) u μ u ν + p g μ ν , p / ρ v 2 ( d x / d t ) 2 | Φ | . T^(mu nu)=(rho+p)u^(mu)u^(nu)+pg^(mu nu),quad p//rho∼v^(2)-=(dx//dt)^(2)∼|Phi|.T^{\mu \nu}=(\rho+p) u^{\mu} u^{\nu}+p g^{\mu \nu}, \quad p / \rho \sim v^{2} \equiv(d x / d t)^{2} \sim|\Phi| .Tμν=(ρ+p)uμuν+pgμν,p/ρv2(dx/dt)2|Φ|.
Let the Newtonian potential satisfy the source equation
Φ , j j = 4 π ρ Φ , j j = 4 π ρ Phi_(,jj)=4pi rho\Phi_{, j j}=4 \pi \rhoΦ,jj=4πρ
Show that the energy of the source is
P 0 = ( T 00 + t 00 ) ( g ) d 3 x (20.30) = [ ρ / ( ( 1 v 2 ) 1 / 2 + 1 2 ρ v 2 + 1 2 ρ Φ ] ( g x x g v y g z z ) 1 / 2 d x d y d z [ Lorentz contraction factor ] [ kinetic energy ] ↑↑ [ potential energy ] [ proper volume ] + higher-order corrections. P 0 = T 00 + t 00 ( g ) d 3 x (20.30) = [ ρ / ( 1 v 2 1 / 2 + 1 2 ρ v 2 + 1 2 ρ Φ g x x g v y g z z 1 / 2 d x d y d z  Lorentz   contraction   factor   kinetic   energy  ↑↑  potential   energy   proper   volume  +  higher-order corrections.  {:[P^(0)=int(T^(00)+t^(00))(-g)d^(3)x],[(20.30)=int[rho//(ubrace((1-v^(2))^(1//2)ubrace)+ubrace((1)/(2)rhov^(2)ubrace)+ubrace((1)/(2)rho Phi]ubrace)ubrace((g_(xx)g_(vy)g_(zz))^(1//2)dxdydzubrace)],[[[" Lorentz "],[" contraction "],[" factor "]]uarr[[" kinetic "],[" energy "]]uarr uarr[[" potential "],[" energy "]]uarr[[" proper "],[" volume "]]],[+" higher-order corrections. "]:}\begin{align*} & P^{0}=\int\left(T^{00}+t^{00}\right)(-g) d^{3} x \\ & =\int[\rho /(\underbrace{\left(1-v^{2}\right)^{1 / 2}}+\underbrace{\frac{1}{2} \rho v^{2}}+\underbrace{\left.\frac{1}{2} \rho \Phi\right]} \underbrace{\left(g_{x x} g_{v y} g_{z z}\right)^{1 / 2} d x d y d z} \tag{20.30}\\ & {\left[\begin{array}{l} \text { Lorentz } \\ \text { contraction } \\ \text { factor } \end{array}\right] \uparrow\left[\begin{array}{l} \text { kinetic } \\ \text { energy } \end{array}\right] \uparrow \uparrow\left[\begin{array}{l} \text { potential } \\ \text { energy } \end{array}\right] \uparrow\left[\begin{array}{l} \text { proper } \\ \text { volume } \end{array}\right]} \\ & + \text { higher-order corrections. } \end{align*}P0=(T00+t00)(g)d3x(20.30)=[ρ/((1v2)1/2+12ρv2+12ρΦ](gxxgvygzz)1/2dxdydz[ Lorentz  contraction  factor ][ kinetic  energy ]↑↑[ potential  energy ][ proper  volume ]+ higher-order corrections. 
(c) Show that the "equations of motion" T L Leff , v μ ν = 0 T L Leff , v μ ν = 0 T_(L-Leff,v)^(mu nu)=0T_{\mathrm{L}-\mathrm{Leff}, v}^{\mu \nu}=0TLLeff,vμν=0 reduce to the standard equations (16.3) of Newtonian hydrodynamics.

§20.6. EQUATIONS OF MOTION DERIVED FROM THE FIELD EQUATION

Consider the Einstein field equation
(20.31) G = 8 π T (20.31) G = 8 π T {:(20.31)G=8pi T:}\begin{equation*} \boldsymbol{G}=8 \pi \boldsymbol{T} \tag{20.31} \end{equation*}(20.31)G=8πT
under conditions where space is empty of everything except a source-free electromagnetic field:
(20.32) T μ ν = 1 4 π ( F μ α g α β F ν β 1 4 g μ ν F σ τ F σ τ ) (20.32) T μ ν = 1 4 π F μ α g α β F ν β 1 4 g μ ν F σ τ F σ τ {:(20.32)T^(mu nu)=(1)/(4pi)(F^(mu alpha)g_(alpha beta)F^(nu beta)-(1)/(4)g^(mu nu)F_(sigma tau)F^(sigma tau)):}\begin{equation*} T^{\mu \nu}=\frac{1}{4 \pi}\left(F^{\mu \alpha} g_{\alpha \beta} F^{\nu \beta}-\frac{1}{4} g^{\mu \nu} F_{\sigma \tau} F^{\sigma \tau}\right) \tag{20.32} \end{equation*}(20.32)Tμν=14π(FμαgαβFνβ14gμνFστFστ)
(cf. the expression for stress-energy tensor of the electromagnetic field in §5.6). To predict from (20.31) how the geometry changes with time, one has to know how the electromagnetic field changes with time. The field is expressed as the "exterior derivative" of the 4-potential,
F = d A (language of forms) F = d A  (language of forms)  F=dA" (language of forms) "\boldsymbol{F}=\boldsymbol{d} \boldsymbol{A} \text { (language of forms) }F=dA (language of forms) 
or
(20.33) F μ ν = A ν x μ A μ x ν (language of components), (20.33) F μ ν = A ν x μ A μ x ν  (language of components),  {:(20.33)F_(mu nu)=(delA_(nu))/(delx^(mu))-(delA_(mu))/(delx^(nu))" (language of components), ":}\begin{equation*} F_{\mu \nu}=\frac{\partial A_{\nu}}{\partial x^{\mu}}-\frac{\partial A_{\mu}}{\partial x^{\nu}} \text { (language of components), } \tag{20.33} \end{equation*}(20.33)Fμν=AνxμAμxν (language of components), 
and the time rate of change of the field is governed by the Maxwell equation
d F = 0 d F = 0 d^(**)F=0\boldsymbol{d}^{*} \boldsymbol{F}=0dF=0
or
(20.34) F ; ν μ ν = 0 (20.34) F ; ν μ ν = 0 {:(20.34)F_(;nu)^(mu nu)=0:}\begin{equation*} F_{; \nu}^{\mu \nu}=0 \tag{20.34} \end{equation*}(20.34)F;νμν=0
Vacuum Maxwell equations derived from Einstein field equation
If it seems a fair division of labor for the Maxwell equation to predict the development in time of the Maxwell field and the Einstein equation to do the same for the Einstein field, then it may come as a fresh surprise to discover that the Einstein equation (20.31), plus expression (20.32) for the Maxwell stress-energy, can do both jobs. One does not have to be given the Maxwell "equation of motion" (20.34). One can derive it fresh from (20.31) plus (20.32). The proof proceeds in five steps (see also exercise 3.18 and § 5.10 § 5.10 §5.10\S 5.10§5.10 ). Step one: The Bianchi identity G 0 G 0 grad*G-=0\boldsymbol{\nabla} \cdot \boldsymbol{G} \equiv 0G0 implies conservation of energy-momentum T = 0 T = 0 grad*T=0\boldsymbol{\nabla} \cdot \boldsymbol{T}=0T=0. Step two: Conservation expresses itself in the language of components in the form
0 = 8 π T ; ν μ ν = 2 F ; ν μ α g α β F ν β + 2 F μ α g α β F ; ν ν β ; ν (20.35) g μ ν F σ τ ; ν F σ τ . 0 = 8 π T ; ν μ ν = 2 F ; ν μ α g α β F ν β + 2 F μ α g α β F ; ν ν β ; ν (20.35) g μ ν F σ τ ; ν F σ τ . {:[0=8piT_(;nu)^(mu nu)=2F_(;nu)^(mu alpha)g_(alpha beta)F^(nu beta)+2F^(mu alpha)g_(alpha beta)F_(;nu)^(nu beta)_(;nu)],[(20.35)-g^(mu nu)F_(sigma tau;nu)F^(sigma tau).]:}\begin{align*} 0=8 \pi T_{; \nu}^{\mu \nu}= & 2 F_{; \nu}^{\mu \alpha} g_{\alpha \beta} F^{\nu \beta}+2 F^{\mu \alpha} g_{\alpha \beta} F_{; \nu}^{\nu \beta}{ }_{; \nu} \\ & -g^{\mu \nu} F_{\sigma \tau ; \nu} F^{\sigma \tau} . \tag{20.35} \end{align*}0=8πT;νμν=2F;νμαgαβFνβ+2FμαgαβF;ννβ;ν(20.35)gμνFστ;νFστ.
Step three: Leaving the middle term unchanged, rearrange the first term so that, like the last term, it carries a factor F σ τ F σ τ F^(sigma tau)F^{\sigma \tau}Fστ. Thus in that first term let the indices ν β ν β nu beta\nu \betaνβ of F ν β F ν β F^(nu beta)F^{\nu \beta}Fνβ be replaced in turn by σ τ σ τ sigma tau\sigma \tauστ and by τ σ τ σ tau sigma\tau \sigmaτσ, to subdivide that term into
F ; σ σ μ α g α τ F σ τ + F μ α ; τ g α σ F τ σ = ( F τ ; σ μ F σ ; τ μ ) F σ τ (20.36) = g μ ν ( F ν τ ; σ + F σ ν ; τ ) F σ τ F ; σ σ μ α g α τ F σ τ + F μ α ; τ g α σ F τ σ = F τ ; σ μ F σ ; τ μ F σ τ (20.36) = g μ ν F ν τ ; σ + F σ ν ; τ F σ τ {:[F_(;sigma sigma)^(mu alpha)g_(alpha tau)F^(sigma tau)+F^(mu alpha)_(;tau)g_(alpha sigma)F^(tau sigma)],[quad=(F_(tau;sigma)^(mu)-F_(sigma;tau)^(mu))F^(sigma tau)],[(20.36)quad=g^(mu nu)(F_(nu tau;sigma)+F_(sigma nu;tau))F^(sigma tau)]:}\begin{align*} & F_{; \sigma \sigma}^{\mu \alpha} g_{\alpha \tau} F^{\sigma \tau}+F^{\mu \alpha}{ }_{; \tau} g_{\alpha \sigma} F^{\tau \sigma} \\ & \quad=\left(F_{\tau ; \sigma}^{\mu}-F_{\sigma ; \tau}^{\mu}\right) F^{\sigma \tau} \\ & \quad=g^{\mu \nu}\left(F_{\nu \tau ; \sigma}+F_{\sigma \nu ; \tau}\right) F^{\sigma \tau} \tag{20.36} \end{align*}F;σσμαgατFστ+Fμα;τgασFτσ=(Fτ;σμFσ;τμ)Fστ(20.36)=gμν(Fντ;σ+Fσν;τ)Fστ
Step four: Combine the first and the last terms in (20.35) to give
(20.37) g μ ν ( F ν τ ; σ + F σ ν ; τ + F τ σ ; ν ) F σ τ (20.37) g μ ν F ν τ ; σ + F σ ν ; τ + F τ σ ; ν F σ τ {:(20.37)g^(mu nu)(F_(nu tau;sigma)+F_(sigma nu;tau)+F_(tau sigma;nu))F^(sigma tau):}\begin{equation*} g^{\mu \nu}\left(F_{\nu \tau ; \sigma}+F_{\sigma \nu ; \tau}+F_{\tau \sigma ; \nu}\right) F^{\sigma \tau} \tag{20.37} \end{equation*}(20.37)gμν(Fντ;σ+Fσν;τ+Fτσ;ν)Fστ
The indices on the derivatives of the field quantities stand in cyclic order. This circumstance annuls all the terms in the connection coefficients Γ α β γ Γ α β γ Gamma^(alpha)_(beta gamma)\Gamma^{\alpha}{ }_{\beta \gamma}Γαβγ when one writes out the covariant derivatives explicitly. Thus one can replace the covariant derivatives by ordinary derivatives. Moreover, these three derivatives annul one another identically when one substitutes for the fields their expressions (20.33) in terms of the potentials. Consequently, nothing remains in the conservation law (20.35) except the middle term, giving rise to four statements ( μ = 0 , 1 , 2 , 3 ) ( μ = 0 , 1 , 2 , 3 ) (mu=0,1,2,3)(\mu=0,1,2,3)(μ=0,1,2,3)
(20.38) F β μ F β v ; v = 0 (20.38) F β μ F β v ; v = 0 {:(20.38)F_(beta)^(mu)F^(beta v)_(;v)=0:}\begin{equation*} F_{\beta}^{\mu} F^{\beta v}{ }_{; v}=0 \tag{20.38} \end{equation*}(20.38)FβμFβv;v=0
about the four quantities ( β = 0 , 1 , 2 , 3 ) ( β = 0 , 1 , 2 , 3 ) (beta=0,1,2,3)(\beta=0,1,2,3)(β=0,1,2,3)
(20.39) F ; v β v (20.39) F ; v β v {:(20.39)F_(;v)^(beta v):}\begin{equation*} F_{; v}^{\beta v} \tag{20.39} \end{equation*}(20.39)F;vβv
Step five: The determinant of the coefficients in the four equations (20.38) for the four unknowns (20.39) has the value
(20.40) | F 0 0 F 0 1 F 2 0 F 3 0 F 3 0 F 3 1 F 3 2 F 3 3 | = ( E B ) 2 (20.40) F 0 0 F 0 1 F 2 0 F 3 0 F 3 0 F 3 1 F 3 2 F 3 3 = ( E B ) 2 {:(20.40)|[F_(0)^(0)F^(0)_(1)F_(2)^(0)F_(3)^(0)],[cdots cdots cdots cdots cdots],[cdots cdots cdots cdots cdots],[F^(3)_(0)F^(3)_(1)F^(3)_(2)F^(3)_(3)]|=-(E*B)^(2):}\left|\begin{array}{l} F_{0}^{0} F^{0}{ }_{1} F_{2}^{0} F_{3}^{0} \tag{20.40}\\ \cdots \cdots \cdots \cdots \cdots \\ \cdots \cdots \cdots \cdots \cdots \\ F^{3}{ }_{0} F^{3}{ }_{1} F^{3}{ }_{2} F^{3}{ }_{3} \end{array}\right|=-(\boldsymbol{E} \cdot \boldsymbol{B})^{2}(20.40)|F00F01F20F30F30F31F32F33|=(EB)2
(see exercise 20.6, part i). In the generic case, this one function of the four variables ( t , x , y , z t , x , y , z t,x,y,zt, x, y, zt,x,y,z ) vanishes on one or more hypersurfaces; but off any such hypersurface (i.e., at "normal points" in spacetime) it differs from zero. At all normal points, the solution of the four linear equations (20.38) with their nonvanishing determinant gives identically zero for the four unknowns (20.39); that is to say, Maxwell's "equations of motion"
F β ν ; ν = 0 F β ν ; ν = 0 F^(beta nu)_(;nu)=0F^{\beta \nu}{ }_{; \nu}=0Fβν;ν=0
are fulfilled and must be fulfilled as a straight consequence of Einstein's field equation (20.31)-plus expression 20.32 for the stress-energy tensor. Special cases admit counterexamples (see exercise 20.8); but in the generic case one need not invoke Maxwell's equations of motion; one can deduce them from the Einstein field equation.
Turn from the dynamics of the Maxwell field itself to the dynamics of a charged particle moving under the influence of the Maxwell field. Make no more appeal to outside providence for the Lorentz equation of motion than for the Maxwell equation of motion. Instead, to generate the Lorentz equation call once more on the Einstein field equation or, more directly, on its consequence, the principle of the local conservation of energy-momentum.
Keep track of the world line of the particle from t = t t = t t=tt=tt=t to t = t + Δ t t = t + Δ t t=t+Delta tt=t+\Delta tt=t+Δt (Figure 20.1). Generate a "world tube" around this world line. Thus, at each value of the time coordinate t t ttt, take the location of the particle as center; construct a sphere of radius ϵ ϵ epsilon\epsilonϵ around this center; and note how the successive spheres sweep out the desired world tube. Construct "caps" on this tube at times t t ttt and t + Δ t t + Δ t t+Delta tt+\Delta tt+Δt. The two caps, together with the world tube proper, bound a region of spacetime in which energy and momentum can be neither created nor destroyed ("no creation of moment of rotation," in the language of the Bianchi identities, Chapter 15). Therefore the energy-momentum emerging out of the "top" cap has to equal the energy-momentum entering the "bottom" cap, supplemented by the amount of energy-momentum carried in across the world-tube by the Maxwell field. Out of such an analysis, as performed in flat spacetime, one ends up with the Lorentz equation of motion in its elementary form (see Chapters 3 and 4),
Figure 20.1.
"World tube." The change in the 4 -momentum of the particle is governed by the flow of 4 -momentum across the boundary of the world tube.
Lorentz force equation derived from the Einstein field equation
d p / d τ = e F , u (language of forms) d p / d τ = e F , u  (language of forms)  dp//d tau=e(:F,u:)quad" (language of forms) "d \boldsymbol{p} / d \tau=e\langle\boldsymbol{F}, \boldsymbol{u}\rangle \quad \text { (language of forms) }dp/dτ=eF,u (language of forms) 
or in curved spacetime, the Lorentz equation of motion in covariant form,
u p = m u u = e F , u (form language) u p = m u u = e F , u  (form language)  grad_(u)p=mgrad_(u)u=e(:F,u:)quad" (form language) "\boldsymbol{\nabla}_{\boldsymbol{u}} \boldsymbol{p}=m \boldsymbol{\nabla}_{\boldsymbol{u}} \boldsymbol{u}=e\langle\boldsymbol{F}, \boldsymbol{u}\rangle \quad \text { (form language) }up=muu=eF,u (form language) 
or
(20.41) m [ d 2 x α d τ 2 + Γ α μ ν d x μ d τ d x ν d τ ] = e F α β d x β d τ (component language). (20.41) m d 2 x α d τ 2 + Γ α μ ν d x μ d τ d x ν d τ = e F α β d x β d τ  (component language).  {:(20.41)m[(d^(2)x^(alpha))/(dtau^(2))+Gamma^(alpha)_(mu nu)(dx^(mu))/(d tau)(dx^(nu))/(d tau)]=eF^(alpha)_(beta)(dx^(beta))/(d tau)" (component language). ":}\begin{equation*} m\left[\frac{d^{2} x^{\alpha}}{d \tau^{2}}+\Gamma^{\alpha}{ }_{\mu \nu} \frac{d x^{\mu}}{d \tau} \frac{d x^{\nu}}{d \tau}\right]=e F^{\alpha}{ }_{\beta} \frac{d x^{\beta}}{d \tau} \text { (component language). } \tag{20.41} \end{equation*}(20.41)m[d2xαdτ2+Γαμνdxμdτdxνdτ]=eFαβdxβdτ (component language). 
"One ends up with the Lorentz equation of motion"-but only after hurdling problems of principle along the way. One would understand what a particle is if one understood how to do the calculation of balance of energy-momentum with all rigor! Few calculations in all of physics have been done in so many ways by so many leading investigators, from Lorentz and his predecessors to Dirac and Rohrlich [see Teitelboim ( 1970 , 1971 ) ( 1970 , 1971 ) (1970,1971)(1970,1971)(1970,1971) for still further insights]. Among the issues that develop are two that never cease to compel attention. (1) The particle responds according to the Lorentz force law (20.41) to a field. This field is the sum of a contribution from external sources and from the particle itself. How is the field exerted by the particle on itself to be calculated? Insofar as it is not already included in its effects in the "experimental mass" m m mmm in (20.41), this force is to be calculated as half the difference between the retarded field and the advanced field caused by that particle (see § 36.11 § 36.11 §36.11\S 36.11§36.11 for a more detailed discussion of the corresponding point for an emitter of gravitational radiation). This difference is singularity-free. On the world line, it has the following simple value [valid in general for point particles; valid for finite-sized particles when and only when the particle changes its velocity negligibly compared to the speed of light during the light-travel time across itselfsee, e.g., Burke (1970)]
(20.42) 1 2 ( F ret F adv ) μ ν = 2 e 3 ( d x μ d τ d 3 x ν d τ 3 d 3 x μ d τ 3 d x ν d τ ) (20.42) 1 2 F ret F adv μ ν = 2 e 3 d x μ d τ d 3 x ν d τ 3 d 3 x μ d τ 3 d x ν d τ {:(20.42)(1)/(2)(F_(ret)-F_(adv))^(mu nu)=(2e)/(3)((dx^(mu))/(d tau)(d^(3)x^(nu))/(dtau^(3))-(d^(3)x^(mu))/(dtau^(3))(dx^(nu))/(d tau)):}\begin{equation*} \frac{1}{2}\left(F_{\mathrm{ret}}-F_{\mathrm{adv}}\right)^{\mu \nu}=\frac{2 e}{3}\left(\frac{d x^{\mu}}{d \tau} \frac{d^{3} x^{\nu}}{d \tau^{3}}-\frac{d^{3} x^{\mu}}{d \tau^{3}} \frac{d x^{\nu}}{d \tau}\right) \tag{20.42} \end{equation*}(20.42)12(FretFadv)μν=2e3(dxμdτd3xνdτ3d3xμdτ3dxνdτ)
Every acceptable line of reasoning has always led to expression (20.42). It also represents the field required to reproduce the long-known and thoroughly tested law of radiation damping. (2) "Infinite self-energy." Around a particle at rest, or close to a particle in an arbitrary state of motion, the field is e / r 2 e / r 2 e//r^(2)e / r^{2}e/r2 and the field energy is
(20.43) ( 1 / 8 π ) r min ϵ ( e / r 2 ) 2 4 π r 2 d r = ( e 2 / 2 ) ( r min 1 ϵ 1 ) (20.43) ( 1 / 8 π ) r min ϵ e / r 2 2 4 π r 2 d r = e 2 / 2 r min 1 ϵ 1 {:(20.43)(1//8pi)int_(r_(min))^(epsilon)(e//r^(2))^(2)4pir^(2)dr=(e^(2)//2)(r_(min)^(-1)-epsilon^(-1)):}\begin{equation*} (1 / 8 \pi) \int_{r_{\min }}^{\epsilon}\left(e / r^{2}\right)^{2} 4 \pi r^{2} d r=\left(e^{2} / 2\right)\left(r_{\min }^{-1}-\epsilon^{-1}\right) \tag{20.43} \end{equation*}(20.43)(1/8π)rminϵ(e/r2)24πr2dr=(e2/2)(rmin1ϵ1)
This expression diverges as r min r min r_(min)r_{\min }rmin is allowed to go to zero. To hurdle this difficulty, one arranges the calculation of energy balance in such a way that there always appears the sum of this "self-energy" and the "bare mass." The two terms individually are envisaged as "going to infinity" as r min r min r_(min)r_{\min }rmin goes to zero; but the sum is identified with the "experimental mass" and is required to remain finite. Of course, no particle is a classical object. A proper calculation of the energy has to be conducted at the quantum level. There it is easier to hide from sight the separate infinities-but they
are still present, and promise to remain until the structure of a particle is understood.
Before one turns from the Maxwell and Lorentz equations of motion to a final example (deriving the geodesic equations of motion for an uncharged particle), is it not time to object to the whole program of "deriving an equation of motion from Einstein's field equation"? First, is it not a pretensious parade of pomposity to say it comes "from Einstein's field equation" (and even more, "from Einstein's field equations") when it really comes from a principle so elementary and long established as the law of conservation of 4-momentum? It cannot be contested that this conservation principle, in historical fact, came before geometrodynamics, just as it came before electrodynamics and before the theories of all other established fields. However, in no theory but Einstein's is this principle incorporated as an identity. Only here does the conservation of energy-momentum appear as a fully automatic consequence of the inner working of the machinery of the world (energy density tied to moment of rotation, and moment of rotation automatically conserved; see Chapter 17). Out of Einstein's theory one can derive the equation of motion of a particle. Out of Maxwell's one cannot. Thus, nothing prevents one from acting on a charge with an "external" force, over and above the Lorentz force, nor from tailoring this force in such a way that the charge follows some prescribed world line ("enginedriven source"). It makes no difficulties whatsoever for Maxwell's equations that one has shifted attention from a world line that follows the Lorentz equation of motion to one that does not. Quite the contrary is true in general relativity. To shift from right world line (geodesic) to wrong world line makes the difference between satisfying Einstein's field equation in the vicinity of that world line and being unable to satisfy Einstein's field equation.
The Maxwell field equations are so constructed that they automatically fulfill and demand the conservation of charge; but not everything has charge. The Einstein field equation is so constructed that it automatically fulfills and demands the conservation of momentum-energy; and everything does have energy. The Maxwell field equations are indifferent to the interposition of an "external" force, because that force in no way threatens the principle of conservation of charge. The Einstein field equation cares about every force, because every force is a medium for the exchange of energy.
Electromagnetism has the motto, "I count all the electric charge that's here." All that bears no charge escapes its gaze.
"I weigh all that's here" is the motto of spacetime curvature. No physical entity escapes this surveillance.
Why, then, is the derivation of the geodesic equation of motion of an object said to be based on "Einstein's geometrodynamic field equation" rather than on "the principle of conservation of 4 -momentum"? Because geometry responds by its curvature to mass-energy in every form. Most of all, because geometry outside tells about mass-energy inside, free of all concern about issues of internal structure (violent motions, unknown forces, tortuously curved and even multiply-connected geometry).
If one objection to the plan to derive the equation of motion of a particle "from the field equation" has been disposed of, then the moment has come to deal with
Why one is justified to regard equations of motion as consequences of the Einstein field equation
How one can avoid complexities of particle structure when deriving equations of motion: the "external viewpoint"

Derivation of geodesic

motion from Einstein field equation:
(1) derivation in brief
(2) derivation with care
Coupling of curvature to particle moments produces deviations from geodesic motion
the other natural objection: Is there not an inner contradiction in trying to apply to a "particle" (implying idealization to a point) a field equation that deals with the continuum? Answer: There is a contradiction in dealing with a point. Therefore do not deal with a point. Do not deal with internal structure at all. Analyze the motion by looking at the geometry outside the object. That geometry provides all the handle one needs to follow the motion.
Already here one sees the difference from the derivation of the Lorentz equation of motion as sketched out above. There (1) no advantage was taken of geometry outside as indicator of motion inside; (2) a detailed bookkeeping was envisaged of the localization in space of the electromagnetic energy; and (3) this bookkeeping brought up the issue of the internal structure of the particle, which could not be satisfactorily resolved.
Now begin the analysis in the new geometrodynamic spirit. Surrounding "the Schwarzschild zone of influence" of the object, mark out a "buffer zone" (Figure 20.2) that extends out to the region where the "background geometry" begins to depart substantially from flatness. Idealize the geometry in the buffer zone as that of an unchanging source merging asymptotically ("boundary B B B\mathscr{B}B of buffer zone") into flat space. It suffices to recall the properties of the spacetime geometry far outside an unchanging (i.e., nonradiating) source (exercise 19.3) to draw the key conclusion: relative to this flat spacetime and regardless of its internal structure, the object remains at rest, or continues to move in a straight line at uniform velocity (conservation of total 4 -momentum; § 20.5 § 20.5 §20.5\S 20.5§20.5 ). In other words, it obeys the geodesic equation of motion. If this is the result in a flash, then it is appropriate to go back a step to review it, to find out what it means and what it demands.
When the object is absent and the background geometry alone has to be considered, then the geodesic is a well-defined mathematical construct. Moreover, FermiWalker transport along this geodesic gives a well-defined way to construct a comoving local inertial frame (see §13.6). Relative to this frame, the representative point of the geodesic remains for all time at rest at the origin.
In what way does the presence of the object change this picture? The object possesses an angular momentum, mass quadrupole moments, and higher multipole moments. They interact with the tide-producing accelerations (Riemann curvature) of the background geometry. Depending on the orientation in space of these moments, the interactions drive the object off its geodesic course in one direction or another (see §40.9). These anomalies in the motion go hand in hand with anomalies in the geometry. On and near the ideal mathematical geodesic the metric is Minkowskian. At a point removed from this geodesic by a displacement with Riemann normal coordinates ξ 1 , ξ 2 , ξ 3 ξ 1 , ξ 2 , ξ 3 xi^(1),xi^(2),xi^(3)\xi^{1}, \xi^{2}, \xi^{3}ξ1,ξ2,ξ3 (see §11.6), the metric components differ from their canonical values ( 1 , 1 , 1 , 1 ) ( 1 , 1 , 1 , 1 ) (-1,1,1,1)(-1,1,1,1)(1,1,1,1) by amounts proportional (1) to the squares and products of the ξ m ξ m xi^(m)\xi^{m}ξm and (2) to the components of the Riemann curvature tensor (tide-producing acceleration) of the background geometry. These second-order terms produce departures from ideality in the buffer zone, departures that may be described symbolically as of order
(20.44) δ ( metric ) r 2 R ( spherical harmonic of order two ) (20.44) δ (  metric  ) r 2 R (  spherical harmonic of order two  ) {:(20.44)delta(" metric ")∼r^(2)*R*(" spherical harmonic of order two "):}\begin{equation*} \delta(\text { metric }) \sim r^{2} \cdot R \cdot(\text { spherical harmonic of order two }) \tag{20.44} \end{equation*}(20.44)δ( metric )r2R( spherical harmonic of order two )
Figure 20.2.
"Buffer zone": the shell of space between a a aaa and B B B\mathscr{B}B, where the geometry is appropriately idealized as the spherically symmetric "Schwarzschild geometry" of a localized center of attraction (the object under study) in an asymptotically flat space. Inside a a aaa : the "zone of influence" of the object. In the general case where this object lacks all symmetry, the metric is found to depart more and more from ideal "Schwarzschild character" as the exploration of the geometry is carried inward from Q Q Q\mathscr{Q}Q (effect of angular momentum of the object on the metric; effect of quadrupole moment; effect of higher moments). Outside B B B\mathscr{B}B : the "background geometry." As this geometry is explored at greater and greater distances outside B B B\mathscr{B}B, it is found to depart more and more from flatness (effect of concentrations of mass, gravitational waves, and other geometrodynamics).
Here r r rrr is the distance from the geodesic and R R RRR is the magnitude of the significant components of the curvature tensor. The object produces not only the standard "Schwarzschild" departure from flatness,
(20.45) δ ( metric ) m / r , (20.45) δ (  metric  ) m / r , {:(20.45)delta(" metric ")∼m//r",":}\begin{equation*} \delta(\text { metric }) \sim m / r, \tag{20.45} \end{equation*}(20.45)δ( metric )m/r,
which by itself (in a flat background) would bring about no departure from geodesic motion, but also correction terms which may be symbolized as
(20.46) δ ( metric ) ( S / r 2 ) ( spherical harmonic of order one ) (20.46) δ (  metric  ) S / r 2 (  spherical harmonic of order one  ) {:(20.46)delta(" metric ")∼(S//r^(2))(" spherical harmonic of order one "):}\begin{equation*} \delta(\text { metric }) \sim\left(S / r^{2}\right)(\text { spherical harmonic of order one }) \tag{20.46} \end{equation*}(20.46)δ( metric )(S/r2)( spherical harmonic of order one )
and
(20.47) δ ( metric ) ( t / r 3 ) ( spherical harmonic of order two ) (20.47) δ (  metric  ) t / r 3 (  spherical harmonic of order two  ) {:(20.47)delta(" metric ")∼(t//r^(3))(" spherical harmonic of order two "):}\begin{equation*} \delta(\text { metric }) \sim\left(t / r^{3}\right)(\text { spherical harmonic of order two }) \tag{20.47} \end{equation*}(20.47)δ( metric )(t/r3)( spherical harmonic of order two )
and higher-order terms. Here S ( cm 2 ) S cm 2 S(cm^(2))S\left(\mathrm{~cm}^{2}\right)S( cm2) is a typical component of the angular momentum vector or "spin"; t ( cm 3 ) t cm 3 t(cm^(3))t\left(\mathrm{~cm}^{3}\right)t( cm3) is a representative component of the moment of inertia or quadrupole tensor (see Chapter 36 for details), and higher terms have higher-order coefficients.
The tide-producing acceleration generated by the surroundings of the object ("background geometry") acts on the spin of the object with a force of order R S R S RSR SRS and pulls it away from geodesic motion with an acceleration of the order
(20.48) acceleration ( cm 1 ) R ( cm 2 ) S ( cm 2 ) m ( cm ) (20.48)  acceleration  cm 1 R cm 2 S cm 2 m ( cm ) {:(20.48)" acceleration "(cm^(-1))∼(R(cm^(-2))S(cm^(2)))/(m((cm))):}\begin{equation*} \text { acceleration }\left(\mathrm{cm}^{-1}\right) \sim \frac{R\left(\mathrm{~cm}^{-2}\right) S\left(\mathrm{~cm}^{2}\right)}{m(\mathrm{~cm})} \tag{20.48} \end{equation*}(20.48) acceleration (cm1)R( cm2)S( cm2)m( cm)
(see exercise 40.8). Otherwise stated, the surrounding and the spin both put warps in the geometry, and these warps conspire together to push the object off track.
The sum of the relevant two perturbations in the metric is qualitatively of the form
(20.49) δ g r 2 R + S / r 2 (20.49) δ g r 2 R + S / r 2 {:(20.49)delta g∼r^(2)R+S//r^(2):}\begin{equation*} \delta g \sim r^{2} R+S / r^{2} \tag{20.49} \end{equation*}(20.49)δgr2R+S/r2
The sum is least where r r rrr has a value of the order
(20.50) r ( S / R ) 1 / 4 (20.50) r ( S / R ) 1 / 4 {:(20.50)r∼(S//R)^(1//4):}\begin{equation*} r \sim(S / R)^{1 / 4} \tag{20.50} \end{equation*}(20.50)r(S/R)1/4
and there it has the magnitude
(20.51) δ g ( S R ) 1 / 2 . (20.51) δ g ( S R ) 1 / 2 . {:(20.51)delta g∼(SR)^(1//2).:}\begin{equation*} \delta g \sim(S R)^{1 / 2} . \tag{20.51} \end{equation*}(20.51)δg(SR)1/2.
To "derive the geodesic equation of motion with some preassigned accuracy ϵ ϵ epsilon\epsilonϵ " may be defined to mean that the metric in the buffer zone is Minkowskian within the latitude ϵ ϵ epsilon\epsilonϵ. In the illustrative example, this means that ( S R ) 1 / 2 ( S R ) 1 / 2 (SR)^(1//2)(S R)^{1 / 2}(SR)1/2 is required to be of the order of ϵ ϵ epsilon\epsilonϵ or less. Nothing can be done about the value of R R RRR because the background curvature R R RRR is a feature of the background geometry. One can meet the requirement only by imposing limits on the mass and moments of the object. In the example, where the dominating moment is the angular momentum, one must require that this parameter of the object be less in order of magnitude than the limit
(20.52) S ϵ 2 / R (20.52) S ϵ 2 / R {:(20.52)S∼epsilon^(2)//R:}\begin{equation*} S \sim \epsilon^{2} / R \tag{20.52} \end{equation*}(20.52)Sϵ2/R
Evidently this and similar conditions on the higher moments are most easily satisfied by demanding that the object have spherical symmetry ( S = 0 , t = 0 S = 0 , t = 0 S=0,t=0S=0, t=0S=0,t=0, higher
moments = 0 = 0 =0=0=0 ). Then the perturbation in the metric, again disregarding angle factors and indices, is qualitatively of the form
(20.53) δ g r 2 R + m / r (20.53) δ g r 2 R + m / r {:(20.53)delta g∼r^(2)R+m//r:}\begin{equation*} \delta g \sim r^{2} R+m / r \tag{20.53} \end{equation*}(20.53)δgr2R+m/r
and the buffer zone is best designed to bracket the minimizing value of r r rrr,
(20.54) r Q [ r ( m / R ) 1 / 3 ] r B (20.54) r Q r ( m / R ) 1 / 3 r B {:(20.54)r_(Q) <= [r∼(m//R)^(1//3)] <= r_(B):}\begin{equation*} r_{Q} \leq\left[r \sim(m / R)^{1 / 3}\right] \leq r_{B} \tag{20.54} \end{equation*}(20.54)rQ[r(m/R)1/3]rB
The departure of the metric from Minkowskian perfection in the buffer zone is of the order
(20.55) δ g ( m 2 R ) 1 / 3 (20.55) δ g m 2 R 1 / 3 {:(20.55)delta g∼(m^(2)R)^(1//3):}\begin{equation*} \delta g \sim\left(m^{2} R\right)^{1 / 3} \tag{20.55} \end{equation*}(20.55)δg(m2R)1/3
To achieve any preassigned accuracy ϵ ϵ epsilon\epsilonϵ for δ g δ g delta g\delta gδg, one must demand that the mass be less than a limit of the order
(20.56) m ϵ 3 / 2 / R 1 / 2 (20.56) m ϵ 3 / 2 / R 1 / 2 {:(20.56)m∼epsilon^(3//2)//R^(1//2):}\begin{equation*} m \sim \epsilon^{3 / 2} / R^{1 / 2} \tag{20.56} \end{equation*}(20.56)mϵ3/2/R1/2
No object of finite mass moving under the influence of a complex background will admit a buffer zone where the geometry approaches Minkowskian values with arbitrary precision. Therefore it is incorrect to say that such an object follows a geodesic world line. It is meaningless to say that an object of finite rest mass follows a geodesic world line. World line of what? If the object is a black hole, there is no point inside its "horizon" (capture surface; one-way membrane; see Chapters 33 and 34) that is relevant to the physics going on outside. Geodesic world line within what background geometry? It has no sense to speak of a geometry that "lies behind" or is "background to" a black hole.
Turn from one motion of one object in one spacetime to a continuous one-parameter family of spacetimes, with the mass m m mmm of the object being the parameter that distinguishes one of these solutions of Einstein's field equation from another. Go to the limit m = 0 m = 0 m=0m=0m=0. Then the size of the buffer zone shrinks to zero and the departure of the metric from Minkowskian perfection in the buffer zone also goes to zero. In this limit ("test particle"), it makes sense to say that the object moves in a straight line with uniform velocity in the local inertial frame or, otherwise stated, it pursues a geodesic in the background geometry. Moreover, this background geometry is well-defined: it is the limit of the spacetime geometry as the parameter m m mmm goes to zero [see Infeld and Schild (1949)]. In this sense, the geodesic equation of motion follows as an inescapable consequence of Einstein's field equation.
The concept of "background" as limit of a one-parameter family of spacetimes extends itself to the case where the object bears charge as well as mass, and where the surrounding space is endowed with an electromagnetic field. This time the one-parameter family consists of solutions of the combined Einstein-Maxwell equations. The charge-to-mass ratio e / m e / m e//me / me/m is fixed. The mass m m mmm is again the adjustable parameter. In the limit when m m mmm goes to zero, one is left with (1) a background geometry, (2) a background electromagnetic field, and (3) a world line that obeys
References on derivation of equations of motion from Einstein field equation
Quantum mechanical limitations on the derivation
the general-relativity version of the Lorentz equation of motion in this background as a consequence of the field equations [Chase (1954)]. In contrast, a so-called "unified field theory of gravitation and electromagnetism" that Einstein tentatively put forward at one stage of his thinking, as a conceivable alternative to the combination of his standard 1915 geometrodynamics with Maxwell's standard electrodynamics, has been shown [Callaway (1953)] to lead to the wrong equation of motion for a charged particle. It moves as if uncharged no matter how much charge is piled on its back. If that theory were correct, no cyclotron could operate, no atom could exist, and life itself would be impossible.
Thus the ability to yield the correct equation of motion of a particle has today become an added ace in the hand of general relativity. The idea for such a treatment dates back to Einstein and Grommer (1927). Corrections to the geodesic equation of motion arising from interaction between the spin of the object (when it has finite dimensions) and the curvature of the background geometry are treated by Papapetrou (1951) and more completely by Pirani (1956) (see exercise 40.8). A book on the subject exists [Infeld and Plebanski (1960)]. Section 40.9 describes how corrections to geodesic motion affect lunar and planetary orbits. Some of the problems that arise when the object under study fragments or emits a directional stream of radiation, and unresolved issues of principle, are discussed by Wheeler (1961).
When one turns from the limit of infinitesimal mass to an object of finite mass, no simpler situation presents itself then a system of uncharged black holes (Chapter 33). Everything about the motion of these objects follows from an application of the source-free Einstein equation G = 0 G = 0 G=0\boldsymbol{G}=0G=0 to the region of spacetime outside the horizons (see Chapter 34) of the several objects. The theory of motion is then geometrodynamics and nothing but geometrodynamics.
It has to be emphasized that all the considerations on motion in this section are carried out in the context of classical theory. In the real world of quantum physics, the geometry everywhere experiences unavoidable, natural, zero-point fluctuations (Chapter 43). The calculated local curvatures associated with these fluctuations at the Planck scale of distances [ L = ( G / c 3 ) 1 / 2 = 1.6 × 10 33 cm ] L = G / c 3 1 / 2 = 1.6 × 10 33 cm [L=(ℏG//c^(3))^(1//2)=1.6 xx10^(-33)(cm)]\left[L=\left(\hbar G / c^{3}\right)^{1 / 2}=1.6 \times 10^{-33} \mathrm{~cm}\right][L=(G/c3)1/2=1.6×1033 cm] are enormous [ R 1 / L 2 0.4 × 10 66 cm 2 R 1 / L 2 0.4 × 10 66 cm 2 R∼1//L^(2)∼0.4 xx10^(66)cm^(-2)R \sim 1 / L^{2} \sim 0.4 \times 10^{66} \mathrm{~cm}^{-2}R1/L20.4×1066 cm2 ] compared to the curvature produced on much larger scales by any familiar object (electron or star). No detailed analysis of the interaction of these two curvatures has ever been made. Such an analysis would define a smoothed-out average of the geometry over regions larger than the local quantum fluctuations. With respect to this average geometry, the object will follow geodesic motion: this is the expectation that no one has ever seen any reason to question-but that no one has proved.

EXERCISES

Exercise 20.6. SIMPLE FEATURES OF THE ELECTROMAGNETIC FIELD AND ITS STRESS-ENERGY TENSOR
(a) Show that the "scalar" 1 / 2 F α β F α β 1 / 2 F α β F α β -1//2F_(alpha beta)F^(alpha beta)-1 / 2 F_{\alpha \beta} F^{\alpha \beta}1/2FαβFαβ (invariant with respect to coordinate transformations) and the "pseudoscalar" 1 / 4 F α β F α β 1 / 4 F α β F α β 1//4F_(alpha beta)^(**)F^(alpha beta)1 / 4 F_{\alpha \beta}{ }^{*} F^{\alpha \beta}1/4FαβFαβ (reproduces itself under a coordinate transformation up to a ± ± +-\pm± sign, according as the sign of the Jacobian of the transformation is positive
or negative) have in any local inertial frame the values E 2 B 2 E 2 B 2 E^(2)-B^(2)\boldsymbol{E}^{2}-\boldsymbol{B}^{2}E2B2 and E B E B E*B\boldsymbol{E} \cdot \boldsymbol{B}EB, respectively ("the two Lorentz invariants" of the electromagnetic field).
(b) Show that the Poynting flux ( E × B ) / 4 π ( E × B ) / 4 π (E xx B)//4pi(\boldsymbol{E} \times \boldsymbol{B}) / 4 \pi(E×B)/4π is less in magnitude than the energy density ( E 2 + B 2 ) / 8 π E 2 + B 2 / 8 π (E^(2)+B^(2))//8pi\left(\boldsymbol{E}^{2}+\boldsymbol{B}^{2}\right) / 8 \pi(E2+B2)/8π, save for the exceptional case where both Lorentz invariants of the field vanish (case where the field is locally "null").
(c) A charged pith ball is located a small distance from the North Pole of a bar magnet. Draw the pattern of electric and magnetic lines of force, indicating where the electromagnetic field is "null" in character. Is it legitimate to say that a "null field" is a "radiation field"?
(d) A plane wave is traveling in the z z zzz-direction. Show that the corresponding electromagnetic field is everywhere null.
(e) Show that the superposition of two monochromatic plane waves traveling in different directions is null on at most a set of points of measure zero.
(f) In the "generic case" where the field ( E , B ) ( E , B ) (E,B)(\boldsymbol{E}, \boldsymbol{B})(E,B) at the point of interest is not null, show that the Poynting flux is reduced to zero by viewing the field from a local inertial frame that is traveling in the direction of E × B E × B E xx B\boldsymbol{E} \times \boldsymbol{B}E×B with a velocity
(20.57) v = tanh α (20.57) v = tanh α {:(20.57)v=tanh alpha:}\begin{equation*} v=\tanh \alpha \tag{20.57} \end{equation*}(20.57)v=tanhα
where the velocity parameter α α alpha\alphaα is given by the formula
(20.58) tanh 2 α = (Poynting flux) (energy density) = 2 | E × B | E 2 + B 2 . (20.58) tanh 2 α =  (Poynting flux)   (energy density)  = 2 | E × B | E 2 + B 2 . {:(20.58)tanh 2alpha=(" (Poynting flux) ")/(" (energy density) ")=(2|E xx B|)/(E^(2)+B^(2)).:}\begin{equation*} \tanh 2 \alpha=\frac{\text { (Poynting flux) }}{\text { (energy density) }}=\frac{2|\boldsymbol{E} \times \boldsymbol{B}|}{\boldsymbol{E}^{2}+\boldsymbol{B}^{2}} . \tag{20.58} \end{equation*}(20.58)tanh2α= (Poynting flux)  (energy density) =2|E×B|E2+B2.
(g) Show that all components of the electric and magnetic field in this new frame can be taken to be zero except E x E x E_(x)E_{x}Ex and B x B x B_(x)B_{x}Bx.
(h) Show that the 4 × 4 4 × 4 4xx44 \times 44×4 determinant built out of the components of the field in mixed representation, F α β F α β F_(alpha)^(beta)F_{\alpha}{ }^{\beta}Fαβ, is invariant with respect to general coordinate transformations. (Hint: Use the theorem that the determinant of the product of three matrices is equal to the product of the determinants of those three matrices.)
(i) Show that this determinant has the value ( E B ) 2 ( E B ) 2 -(E*B)^(2)-(\boldsymbol{E} \cdot \boldsymbol{B})^{2}(EB)2 by evaluating it in the special local inertial frame of (f).
(j) Show that in this special frame the Maxwell stress-energy tensor has the form
(20.59) T μ ν = E x 2 + B x 2 8 π 1 0 0 0 0 1 0 0 0 0 + 1 0 0 0 0 + 1 (20.59) T μ ν = E x 2 + B x 2 8 π 1 0 0 0 0 1 0 0 0 0 + 1 0 0 0 0 + 1 {:(20.59)||T^(mu)_(nu)||=(E_(x)^(2)+B_(x)^(2))/(8pi)||[-1,0,0,0],[0,-1,0,0],[0,0,+1,0],[0,0,0,+1]||:}\left\|T^{\mu}{ }_{\nu}\right\|=\frac{E_{x}{ }^{2}+B_{x}{ }^{2}}{8 \pi}\left\|\begin{array}{rrrr} -1 & 0 & 0 & 0 \tag{20.59}\\ 0 & -1 & 0 & 0 \\ 0 & 0 & +1 & 0 \\ 0 & 0 & 0 & +1 \end{array}\right\|(20.59)Tμν=Ex2+Bx28π1000010000+10000+1
(Faraday tension along the lines of force; Faraday pressure at right angles to the lines of force).
(k) In the other case, where the field is locally null, show that one can always find a local inertial frame in which the field has the form E = ( 0 , F , 0 ) , B = ( 0 , 0 , F ) E = ( 0 , F , 0 ) , B = ( 0 , 0 , F ) E=(0,F,0),B=(0,0,F)\boldsymbol{E}=(0, F, 0), \boldsymbol{B}=(0,0, F)E=(0,F,0),B=(0,0,F) and the stress-energy tensor has the value
(20.60) T μ ν = F 2 4 π 1 1 0 0 1 1 0 0 0 0 0 0 0 0 0 0 ( μ for row, ν for column). (20.60) T μ ν = F 2 4 π 1 1 0 0 1 1 0 0 0 0 0 0 0 0 0 0  (  μ  for row,  ν  for column).  {:(20.60)||T^(mu)_(nu)||=(F^(2))/(4pi)||[-1,1,0,0],[-1,1,0,0],[0,0,0,0],[0,0,0,0]||" ( "mu" for row, "nu" for column). ":}\left\|T^{\mu}{ }_{\nu}\right\|=\frac{F^{2}}{4 \pi}\left\|\begin{array}{rrrr} -1 & 1 & 0 & 0 \tag{20.60}\\ -1 & 1 & 0 & 0 \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \end{array}\right\| \text { ( } \mu \text { for row, } \nu \text { for column). }(20.60)Tμν=F24π1100110000000000 ( μ for row, ν for column). 
(1) Regardless of whether the electromagnetic field is or is not null, show that the Maxwell stress-energy tensor has zero trace, T μ μ = 0 T μ μ = 0 T^(mu)_(mu)=0T^{\mu}{ }_{\mu}=0Tμμ=0, and that its square is a multiple of the unit tensor,
T α μ T α ν = δ μ v ( 8 π ) 2 [ ( E 2 B 2 ) 2 + ( 2 E B ) 2 ] (20.61) = δ μ v ( 8 π ) 2 [ ( E 2 + B 2 ) 2 ( 2 E × B ) 2 ] . T α μ T α ν = δ μ v ( 8 π ) 2 E 2 B 2 2 + ( 2 E B ) 2 (20.61) = δ μ v ( 8 π ) 2 E 2 + B 2 2 ( 2 E × B ) 2 . {:[T_(alpha)^(mu)T^(alpha)_(nu)=(delta^(mu)_(v))/((8pi)^(2))[(E^(2)-B^(2))^(2)+(2E*B)^(2)]],[(20.61)=(delta^(mu)_(v))/((8pi)^(2))[(E^(2)+B^(2))^(2)-(2E xx B)^(2)].]:}\begin{align*} T_{\alpha}^{\mu} T^{\alpha}{ }_{\nu} & =\frac{\delta^{\mu}{ }_{v}}{(8 \pi)^{2}}\left[\left(\boldsymbol{E}^{2}-\boldsymbol{B}^{2}\right)^{2}+(2 \boldsymbol{E} \cdot \boldsymbol{B})^{2}\right] \\ & =\frac{\delta^{\mu}{ }_{v}}{(8 \pi)^{2}}\left[\left(\boldsymbol{E}^{2}+\boldsymbol{B}^{2}\right)^{2}-(2 \boldsymbol{E} \times \boldsymbol{B})^{2}\right] . \tag{20.61} \end{align*}TαμTαν=δμv(8π)2[(E2B2)2+(2EB)2](20.61)=δμv(8π)2[(E2+B2)2(2E×B)2].

Exercise 20.7. THE STRESS-ENERGY TENSOR DETERMINES THE ELECTROMAGNETIC FIELD EXCEPT FOR ITS COMPLEXION

(a) Given a non-zero symmetric 4 × 4 4 × 4 4xx44 \times 44×4 tensor T μ ν T μ ν T^(mu nu)T^{\mu \nu}Tμν which has zero trace T μ μ = 0 T μ μ = 0 T^(mu)_(mu)=0T^{\mu}{ }_{\mu}=0Tμμ=0 and whose square is a multiple, M 4 / ( 8 π ) 2 M 4 / ( 8 π ) 2 M^(4)//(8pi)^(2)M^{4} /(8 \pi)^{2}M4/(8π)2, of the unit matrix, show that, according as this multiple is zero ("null case") or positive, the tensor can be transformed to the form (20.60) or (20.59) by a suitable rotation in 3 -space or by a suitable choice of local inertial frame, respectively.
(b) In the generic (non-null) case in the frame in question, show that T μ ν T μ ν T^(mu nu)T^{\mu \nu}Tμν is the Maxwell tensor of the "extremal electromagnetic field" ξ μ ν ξ μ ν xi_(mu nu)\xi_{\mu \nu}ξμν with components
E (extremal) = ( M , 0 , 0 ) , (20.62) B (extremal) = ( 0 , 0 , 0 ) E (extremal)  = ( M , 0 , 0 ) , (20.62) B (extremal)  = ( 0 , 0 , 0 ) {:[E^((extremal) )=(M","0","0)","],[(20.62)B^((extremal) )=(0","0","0)]:}\begin{align*} \boldsymbol{E}^{\text {(extremal) }} & =(M, 0,0), \\ \boldsymbol{B}^{\text {(extremal) }} & =(0,0,0) \tag{20.62} \end{align*}E(extremal) =(M,0,0),(20.62)B(extremal) =(0,0,0)
Show that it is also the Maxwell tensor of the "dual extremal field" ξ μ ν ξ μ ν ^(**)xi_(mu nu){ }^{*} \xi_{\mu \nu}ξμν with components
E ( extremal ) = ( 0 , 0 , 0 ) , (20.63) B (extremal) = ( M , 0 , 0 ) E ( extremal  ) = ( 0 , 0 , 0 ) , (20.63) B (extremal)  = ( M , 0 , 0 ) {:[^(**)E^(("extremal "))=(0","0","0)","],[(20.63)^(**)B^((extremal) )=(M","0","0)]:}\begin{align*} & { }^{*} \boldsymbol{E}^{(\text {extremal })}=(0,0,0), \\ & { }^{*} \boldsymbol{B}^{\text {(extremal) }}=(M, 0,0) \tag{20.63} \end{align*}E(extremal )=(0,0,0),(20.63)B(extremal) =(M,0,0)
(c) Recalling that the duality operation * applied twice to an antisymmetric second-rank tensor (2-form) in four-dimensional space leads back to the negative of that tensor, show that the operator e α e α e^(**alpha)e^{* \alpha}eα ("duality rotation") has the value
(20.64) e α = ( cos α ) + ( sin α ) (20.64) e α = ( cos α ) + ( sin α ) {:(20.64)e^(**alpha)=(cos alpha)+(sin alpha)^(**):}\begin{equation*} e^{* \alpha}=(\cos \alpha)+(\sin \alpha)^{*} \tag{20.64} \end{equation*}(20.64)eα=(cosα)+(sinα)
(d) Show that the most general electromagnetic field which will reproduce the non-null tensor T μ ν T μ ν T^(mu nu)T^{\mu \nu}Tμν in the frame in question, and therefore in any coordinate system, is
(20.65) F μ ν = e α ξ μ v . (20.65) F μ ν = e α ξ μ v . {:(20.65)F_(mu nu)=e^(**alpha)xi_(mu v).:}\begin{equation*} F_{\mu \nu}=e^{* \alpha} \xi_{\mu v} . \tag{20.65} \end{equation*}(20.65)Fμν=eαξμv.
(e) Derive a corresponding result for the null case. [The field F μ ν F μ ν F_(mu nu)F_{\mu \nu}Fμν defined in the one frame and therefore in every coordinate system by (d) and (e) is known as the "Maxwell square root" of T μ ν ; ξ μ ν T μ ν ; ξ μ ν T^(mu nu);xi_(mu nu)T^{\mu \nu} ; \xi_{\mu \nu}Tμν;ξμν is known as the "extremal Maxwell square root" of T μ ν T μ ν T^(mu nu)T^{\mu \nu}Tμν; and the angle α α alpha\alphaα is called the "complexion of the electromagnetic field." See Misner and Wheeler (1957); see also Boxes 20.1 and 20.2, adapted from that paper.]

Box 20.1 CONTRAST BETWEEN PROPER LORENTZ TRANSFORMATION AND DUALITY ROTATION

Quantity
General proper
Lorentz transformation
General proper Lorentz transformation| General proper | | :---: | | Lorentz transformation |
Duality
rotation
Duality rotation| Duality | | :---: | | rotation |
Components of the Maxwell stress-energy tensor or
the "Maxwell square" of the field F F F\boldsymbol{F}F
Components of the Maxwell stress-energy tensor or the "Maxwell square" of the field F| Components of the Maxwell stress-energy tensor or | | :--- | | the "Maxwell square" of the field $\boldsymbol{F}$ |
Transformed Unchanged
The invariants E 2 B 2 E 2 B 2 E^(2)-B^(2)\boldsymbol{E}^{2}-\boldsymbol{B}^{2}E2B2 and ( E B ) 2 ( E B ) 2 (E*B)^(2)(\boldsymbol{E} \cdot \boldsymbol{B})^{2}(EB)2 Unchanged Transformed
The combination [ ( E 2 B 2 ) 2 + ( 2 E B ) 2 ] = E 2 B 2 2 + ( 2 E B ) 2 = [(E^(2)-B^(2))^(2)+(2E*B)^(2)]=\left[\left(\boldsymbol{E}^{2}-\boldsymbol{B}^{2}\right)^{2}+(2 \boldsymbol{E} \cdot \boldsymbol{B})^{2}\right]=[(E2B2)2+(2EB)2]= Unchanged
[ ( E 2 + B 2 ) 2 ( 2 E × B ) 2 ] E 2 + B 2 2 ( 2 E × B ) 2 [(E^(2)+B^(2))^(2)-(2E xx B)^(2)]\left[\left(\boldsymbol{E}^{2}+\boldsymbol{B}^{2}\right)^{2}-(2 \boldsymbol{E} \times \boldsymbol{B})^{2}\right][(E2+B2)2(2E×B)2]
The combination [(E^(2)-B^(2))^(2)+(2E*B)^(2)]= Unchanged [(E^(2)+B^(2))^(2)-(2E xx B)^(2)] | The combination $\left[\left(\boldsymbol{E}^{2}-\boldsymbol{B}^{2}\right)^{2}+(2 \boldsymbol{E} \cdot \boldsymbol{B})^{2}\right]=$ | Unchanged | | :--- | :--- | | $\left[\left(\boldsymbol{E}^{2}+\boldsymbol{B}^{2}\right)^{2}-(2 \boldsymbol{E} \times \boldsymbol{B})^{2}\right]$ | |
Quantity "General proper Lorentz transformation" "Duality rotation" "Components of the Maxwell stress-energy tensor or the "Maxwell square" of the field F" Transformed Unchanged The invariants E^(2)-B^(2) and (E*B)^(2) Unchanged Transformed "The combination [(E^(2)-B^(2))^(2)+(2E*B)^(2)]= Unchanged [(E^(2)+B^(2))^(2)-(2E xx B)^(2)] " | Quantity | General proper <br> Lorentz transformation | Duality <br> rotation | | :--- | :--- | ---: | | Components of the Maxwell stress-energy tensor or <br> the "Maxwell square" of the field $\boldsymbol{F}$ | Transformed | Unchanged | | The invariants $\boldsymbol{E}^{2}-\boldsymbol{B}^{2}$ and $(\boldsymbol{E} \cdot \boldsymbol{B})^{2}$ | Unchanged | Transformed | | The combination $\left[\left(\boldsymbol{E}^{2}-\boldsymbol{B}^{2}\right)^{2}+(2 \boldsymbol{E} \cdot \boldsymbol{B})^{2}\right]=$ Unchanged <br> $\left[\left(\boldsymbol{E}^{2}+\boldsymbol{B}^{2}\right)^{2}-(2 \boldsymbol{E} \times \boldsymbol{B})^{2}\right]$ | | |

Box 20.2 TRANSFORMATION OF THE GENERIC (NON-NULL) ELECTROMAGNETIC FIELD TENSOR F = ( E , B ) F = ( E , B ) F=(E,B)F=(E, B)F=(E,B) IN A LOCAL INERTIAL FRAME

Field values At start
After simplifying
duality rotation
After simplifying duality rotation| After simplifying | | :---: | | duality rotation |
At start E , B E , B E,B\boldsymbol{E}, \boldsymbol{B}E,B
E E E\boldsymbol{E}E and B B B\boldsymbol{B}B perpendicular, and E E E\boldsymbol{E}E
greater than B B B\boldsymbol{B}B
E and B perpendicular, and E greater than B| $\boldsymbol{E}$ and $\boldsymbol{B}$ perpendicular, and $\boldsymbol{E}$ | | :--- | | greater than $\boldsymbol{B}$ |
After simplifying Lorentz
transformation
After simplifying Lorentz transformation| After simplifying Lorentz | | :--- | | transformation |
E E E\boldsymbol{E}E and B B B\boldsymbol{B}B parallel to each other
and parallel to x x xxx-axis
E and B parallel to each other and parallel to x-axis| $\boldsymbol{E}$ and $\boldsymbol{B}$ parallel to each other | | :--- | | and parallel to $x$-axis |
E E E\boldsymbol{E}E parallel to x x xxx-axis and B = 0 B = 0 B=0\boldsymbol{B}=0B=0
Field values At start "After simplifying duality rotation" At start E,B "E and B perpendicular, and E greater than B" "After simplifying Lorentz transformation" "E and B parallel to each other and parallel to x-axis" E parallel to x-axis and B=0| Field values | At start | After simplifying <br> duality rotation | | :--- | :--- | :--- | | At start | $\boldsymbol{E}, \boldsymbol{B}$ | $\boldsymbol{E}$ and $\boldsymbol{B}$ perpendicular, and $\boldsymbol{E}$ <br> greater than $\boldsymbol{B}$ | | After simplifying Lorentz <br> transformation | $\boldsymbol{E}$ and $\boldsymbol{B}$ parallel to each other <br> and parallel to $x$-axis | $\boldsymbol{E}$ parallel to $x$-axis and $\boldsymbol{B}=0$ |

Exercise 20.8. THE MAXWELL EQUATIONS CANNOT BE DERIVED FROM THE LAW OF CONSERVATION OF STRESS-ENERGY WHEN ( E B ) = 0 ( E B ) = 0 (E*B)=0(E \cdot B)=0(EB)=0 OVER AN EXTENDED REGION

Supply a counter-example to the idea that the Maxwell equations,
F ; ν μ ν = 0 F ; ν μ ν = 0 F_(;nu)^(mu nu)=0F_{; \nu}^{\mu \nu}=0F;νμν=0
follow from the Einstein equation; or, more precisely, show that (1) the condition that the Maxwell stress-energy tensor should have a vanishing divergence plus (2) the condition that this Maxwell field is the curl of a 4-potential A μ A μ A_(mu)A_{\mu}Aμ can both be satisfied, while yet the stated Maxwell equations are violated. [Hint: It simplifies the analysis without obscuring the main point to consider the problem in the context of flat spacetime. Refer to the paper of Teitelboim (1970) for the decomposition of the retarded field of an arbitrarily accelerated charge into two parts, of which the second, there called F μ ν I I F μ ν I I F^(mu nu)_(II)F^{\mu \nu}{ }_{I I}FμνII, meets the staped requirements, and has everywhere off the worldline ( E B ) = 0 ( E B ) = 0 (E*B)=0(\boldsymbol{E} \cdot \boldsymbol{B})=0(EB)=0, but does not satisfy thi ted Maxwell equations.]

Exercise 20.9. EQUATION OF MOTION OF A SCALAR F.L_D AS

CONSEQUENCE OF THE EINSTEIN FIELD EQUATION
The stress-energy tensor of a massless scalar field is taken to be
(20.66) T μ ν = ( 1 / 4 π ) ( ϕ , μ ϕ , ν 1 / 2 g μ ν ϕ , α ϕ α ) (20.66) T μ ν = ( 1 / 4 π ) ϕ , μ ϕ , ν 1 / 2 g μ ν ϕ , α ϕ α {:(20.66)T_(mu nu)=(1//4pi)(phi_(,mu)phi_(,nu)-1//2g_(mu nu)phi_(,alpha)phi^(alpha)):}\begin{equation*} T_{\mu \nu}=(1 / 4 \pi)\left(\phi_{, \mu} \phi_{, \nu}-1 / 2 g_{\mu \nu} \phi_{, \alpha} \phi^{\alpha}\right) \tag{20.66} \end{equation*}(20.66)Tμν=(1/4π)(ϕ,μϕ,ν1/2gμνϕ,αϕα)
Derive the equation of motion of this scalar field from Einstein's field equation.
снартев 21

VARIATIONAL PRINCIPLE AND INITIAL-VALUE DATA

Whenever any action occurs in nature, the quantity of action employed by this change is the least possible. PIERRE MOREAU DE MAUPERTUIS (1746)
In the theory of gravitation, as in all other branches of theoretical physics, a mathematically correct statement of a problem must be determinate to the extent allowed by the nature of the problem; if possible, it must ensure the uniqueness of its solution.
VLADIMIR ALEXANDROVITCH FOCK (1959)
Things are as they are because they were as they were.
THOMAS GOLD (1972)
Calculemus
G. W. LEIBNIZ
This chapter is entirely Track 2. No earlier Track-2 material is needed as preparation for it, but Chapters 9-11 and 13-15 will be helpful. It is needed as preparation for Box 30.1 (mixmaster universe) and for Chapters 42 and 43.

§21.1. DYNAMICS REQUIRES INITIAL-VALUE DATA

No plan for predicting the dynamics of geometry could be at the same time more mistaken and more right than this: "Give the distribution of mass-energy; then solve Einstein's second-order equation,
(21.1) G = 8 π T , (21.1) G = 8 π T , {:(21.1)G=8pi T",":}\begin{equation*} \boldsymbol{G}=8 \pi \boldsymbol{T}, \tag{21.1} \end{equation*}(21.1)G=8πT,
for the geometry." Give the distribution of mass-energy in spacetime and solve for the spacetime geometry? No. Give the fields that generate mass-energy, and their
time-rates of change, and give 3-geometry of space and its time-rate of change, all at one time, and solve for the 4 -geometry of spacetime at that one time? Yes. And only then let one's equations for geometrodynamics and field dynamics go on to predict for all time, in and by themselves, needing no further prescriptions from outside (needing only work!), both the spacetime geometry and the flow of massenergy throughout this spacetime. This, in brief, is the built-in "plan" of geometrodynamics, the plan spelled out in more detail in this chapter.
Contest the plan. Point out that the art of solving any coupled set of equations lies in separating the unknowns from what is known or to be prescribed. Insist that this separation is already made in (21.1). On the right already stands the source of curvature. On the left already stands the receptacle of curvature in the form of what one wants to know, the metric coefficients, twice differentiated. Claim therefore that one has nothing to do except to go ahead and solve these equations for the metric coefficients. However, in analyzing the structure of the equations to greater depth [see Cartan (1922a) for the rationale of analyzing a coupled set of partial differential equations], one discovers that one can only make the split between "the source and the receptacle" in the right way when one has first recognized the still more important split between "the initial-value data and the future." Thus-to summarize the results before doing the analysis-four of the ten components of Einstein's law connect the curvature of space here and now with the distribution of mass-energy here and now, and the other six equations tell how the geometry as thus determined then proceeds to evolve.
In determining what are appropriate initial-value data to give, one discovers no guide more useful than the Hilbert variational principle,
(21.2) I = E d 4 x = L ( g ) 1 / 2 d 4 x = ¢exercise 8.16] L d ( proper 4-volume ) = extremum (21.2) I = E d 4 x = L ( g ) 1 / 2 d 4 x =  ¢exercise 8.16]  L d (  proper 4-volume  ) =  extremum  {:(21.2)I=intEd^(4)x=int L(-g)^(1//2)d^(4)x=int_(" ¢exercise 8.16] ")Ld(" proper 4-volume ")=" extremum ":}\begin{equation*} I=\int \mathcal{E} d^{4} x=\int L(-g)^{1 / 2} d^{4} x=\underset{\text { ¢exercise 8.16] }}{\int} L d(\text { proper 4-volume })=\text { extremum } \tag{21.2} \end{equation*}(21.2)I=Ed4x=L(g)1/2d4x= ¢exercise 8.16] Ld( proper 4-volume )= extremum 
or the Arnowitt-Deser-Misner ("ADM") variant of it (§21.6) and generalizations thereof by Kuchař (§21.9). Out of this principle one can recognize most directly what one must hold fixed at the limits (on an initial spacelike hypersurface and on a final spacelike hypersurface) as one varies the geometry ($21.2) throughout the spacetime "filling of this sandwich," if one is to have a well-defined extremum problem.
The Lagrange function L L LLL (scalar function) or the Lagrangian density L = L = L=\mathcal{L}=L= ( g ) 1 / 2 L ( g ) 1 / 2 L (-g)^(1//2)L(-g)^{1 / 2} L(g)1/2L (quantity to be integrated over coordinate volume) is built of geometry alone, when one deals with curved empty space, but normally fields are present as well, and contribute also to the Lagrangian; thus,
(21.3) L = E geom + L field = ( g ) 1 / 2 L ; L = L geom + L field (21.3) L = E geom  + L field  = ( g ) 1 / 2 L ; L = L geom  + L field  {:[(21.3)L=E_("geom ")+L_("field ")=(-g)^(1//2)L;],[L=L_("geom ")+L_("field ")]:}\begin{gather*} \mathcal{L}=\mathcal{E}_{\text {geom }}+\mathcal{L}_{\text {field }}=(-g)^{1 / 2} L ; \tag{21.3}\\ L=L_{\text {geom }}+L_{\text {field }} \end{gather*}(21.3)L=Egeom +Lfield =(g)1/2L;L=Lgeom +Lfield 
The variation of the field Lagrangian with respect to the typical metric coefficient proves to be, of all ways, the one most convenient for generating (that is, for calculating) the corresponding component of the symmetric stress-energy tensor of the field ( $ 21.3 $ 21.3 $21.3\$ 21.3$21.3 ).
Give initial data, predict geometry
Four of ten components of Einstein equation are conditions on initial-value data
A computer, allowing for the effect of this field on the geometry and computing ahead from instant to instant the evolution of the metric with time, imposes its own ordering on the events of spacetime. In effect, it slices spacetime into a great number of spacelike slices. It finds it most convenient ( $ 21.4 $ 21.4 $21.4\$ 21.4$21.4 ) to do separate bookkeeping on (1) the 3 -geometry of the individual slices and (2) the relation between one such slice and the next, as expressed in a "lapse function" N N NNN and a 3 -vector "shift function" N i N i N_(i)N_{i}Ni.
The 3-geometry internal to the individual slice or "simultaneity" defines in and by itself the three-dimensional Riemannian curvature intrinsic to this hypersurface; but for a complete account of this hypersurface one must know also the extrinsic curvature ( $ 21.5 $ 21.5 $21.5\$ 21.5$21.5 ) telling how this hypersurface is curved with respect to the enveloping four-dimensional spacetime manifold.
In terms of the space-plus-time split of the 4 -geometry, the action principle of Hilbert takes a simple and useful form (§21.6).
In the most elementary example of the application of an action principle in mechanics, where one writes
(21.4) I = x , t x , t L ( d x / d t , x , t ) d t (21.4) I = x , t x , t L ( d x / d t , x , t ) d t {:(21.4)I=int_(x^('),t^('))^(x,t)L(dx//dt","x","t)dt:}\begin{equation*} I=\int_{x^{\prime}, t^{\prime}}^{x, t} L(d x / d t, x, t) d t \tag{21.4} \end{equation*}(21.4)I=x,tx,tL(dx/dt,x,t)dt
and extremizes the integral, one already knows that the resultant "dynamic path length" or "dynamic phase" or "action,"
(21.5) S ( x , t ) = I extremum (21.5) S ( x , t ) = I extremum  {:(21.5)S(x","t)=I_("extremum "):}\begin{equation*} S(x, t)=I_{\text {extremum }} \tag{21.5} \end{equation*}(21.5)S(x,t)=Iextremum 
is an important quantity, not least because it gives (up to a factor \hbar ) the phase of the quantum-mechanical wave function. Moreover, the rate of change of this action function with position is what one calls momentum,
(21.6) p = S ( x , t ) / x ; (21.6) p = S ( x , t ) / x ; {:(21.6)p=del S(x","t)//del x;:}\begin{equation*} p=\partial S(x, t) / \partial x ; \tag{21.6} \end{equation*}(21.6)p=S(x,t)/x;
and the (negative of the) rate of change with time gives energy (Figure 21.1),
(21.7) E = S ( x , t ) / t ; (21.7) E = S ( x , t ) / t ; {:(21.7)E=-del S(x","t)//del t;:}\begin{equation*} E=-\partial S(x, t) / \partial t ; \tag{21.7} \end{equation*}(21.7)E=S(x,t)/t;
and the relation between these two features of a system of wave crests,
(21.8) E = H ( p , x ) (21.8) E = H ( p , x ) {:(21.8)E=H(p","x):}\begin{equation*} E=H(p, x) \tag{21.8} \end{equation*}(21.8)E=H(p,x)
call it "dispersion relation" or call it what one will, is the central topic of mechanics.
When dealing with the dynamics of geometry in the Arnowitt-Deser-Misner formulation,* one finds it convenient to think of the specified quantities as being
Figure 21.1.
Momentum and (the negative of the) energy viewed as rate of change of "dynamic phase" or "action,"
(1) S ( x , t ) = I extremum ( x , t ) = ( extremum value of ) x , t x , t L ( x , x ˙ , t ) d t (1) S ( x , t ) = I extremum  ( x , t ) = (  extremum   value of  ) x , t x , t L ( x , x ˙ , t ) d t {:(1)S(x","t)=I_("extremum ")(x","t)=((" extremum ")/(" value of "))int_(x^('),t^('))^(x,t)L(x","x^(˙)","t)dt:}\begin{equation*} S(x, t)=I_{\text {extremum }}(x, t)=\binom{\text { extremum }}{\text { value of }} \int_{x^{\prime}, t^{\prime}}^{x, t} L(x, \dot{x}, t) d t \tag{1} \end{equation*}(1)S(x,t)=Iextremum (x,t)=( extremum  value of )x,tx,tL(x,x˙,t)dt
with respect to position and time; thus,
(2) δ S = p δ x E δ t . (2) δ S = p δ x E δ t . {:(2)delta S=p delta x-E delta t.:}\begin{equation*} \delta S=p \delta x-E \delta t . \tag{2} \end{equation*}(2)δS=pδxEδt.
The variation of the integral I I III with respect to changes of the history along the way, δ x ( t ) δ x ( t ) delta x(t)\delta x(t)δx(t), is already zero by reason of the optimization of the history; so the only change that takes place is
δ S = δ I extremum = L ( x , x ˙ , t ) δ t + x , t x + Δ x , t δ L d t = L δ t + x , t x + Δ x , t ( L x ˙ δ x ˙ + L x δ x ) d t (3) = L δ t + L x ˙ Δ x + x , t x + Δ x , t ( L x d d t L d x ˙ ) δ x d t . [ zero by reason of extremization ] δ S = δ I extremum  = L ( x , x ˙ , t ) δ t + x , t x + Δ x , t δ L d t = L δ t + x , t x + Δ x , t L x ˙ δ x ˙ + L x δ x d t (3) = L δ t + L x ˙ Δ x + x , t x + Δ x , t L x d d t L d x ˙ δ x d t .  zero by reason   of extremization  {:[delta S=deltaI_("extremum ")=L(x","x^(˙)","t)delta t+int_(x^('),t^('))^(x+Delta x,t)delta Ldt],[=L delta t+int_(x^('),t^('))^(x+Delta x,t)((del L)/(del(x^(˙)))delta(x^(˙))+(del L)/(del x)delta x)dt],[(3)=L delta t+(del L)/(del(x^(˙)))Delta x+int_(x^('),t^('))^(x+Delta x,t)ubrace(((del L)/(del x)-(d)/(dt)(del L)/(d(x^(˙))))ubrace) delta xdt.],[[[" zero by reason "],[" of extremization "]]]:}\begin{align*} \delta S & =\delta I_{\text {extremum }}=L(x, \dot{x}, t) \delta t+\int_{x^{\prime}, t^{\prime}}^{x+\Delta x, t} \delta L d t \\ & =L \delta t+\int_{x^{\prime}, t^{\prime}}^{x+\Delta x, t}\left(\frac{\partial L}{\partial \dot{x}} \delta \dot{x}+\frac{\partial L}{\partial x} \delta x\right) d t \\ & =L \delta t+\frac{\partial L}{\partial \dot{x}} \Delta x+\int_{x^{\prime}, t^{\prime}}^{x+\Delta x, t} \underbrace{\left(\frac{\partial L}{\partial x}-\frac{d}{d t} \frac{\partial L}{d \dot{x}}\right)} \delta x d t . \tag{3}\\ & {\left[\begin{array}{l} \text { zero by reason } \\ \text { of extremization } \end{array}\right] } \end{align*}δS=δIextremum =L(x,x˙,t)δt+x,tx+Δx,tδLdt=Lδt+x,tx+Δx,t(Lx˙δx˙+Lxδx)dt(3)=Lδt+Lx˙Δx+x,tx+Δx,t(LxddtLdx˙)δxdt.[ zero by reason  of extremization ]
When one contemplates only a change δ x δ x delta x\delta xδx in the coordinates ( x , t ) ( x , t ) (x,t)(x, t)(x,t) of the end point (change of history from O P O P OP\mathcal{O P}OP to O Q O Q OQ\mathcal{O Q}OQ ), one has Δ x = δ x Δ x = δ x Delta x=delta x\Delta x=\delta xΔx=δx. When one makes only a change δ t δ t delta t\delta tδt in the end point (change of history from O P O P OP\mathcal{O P}OP to O S O S OS\mathcal{O S}OS ), one has Δ x = Δ x = Delta x=\Delta x=Δx= (indicator of change from P P P\mathscr{P}P to R ) = R ) = R)=\mathscr{R})=R)= x ˙ δ t x ˙ δ t -x^(˙)delta t-\dot{x} \delta tx˙δt. For the general variation of the final point, one thus has Δ x = δ x x ˙ δ t Δ x = δ x x ˙ δ t Delta x=delta x-x^(˙)delta t\Delta x=\delta x-\dot{x} \delta tΔx=δxx˙δt and
(4) δ S = L x ˙ δ x ( x ˙ L x ˙ L ) δ t . (4) δ S = L x ˙ δ x x ˙ L x ˙ L δ t . {:(4)delta S=(del L)/(del(x^(˙)))delta x-((x^(˙))(del L)/(del(x^(˙)))-L)delta t.:}\begin{equation*} \delta S=\frac{\partial L}{\partial \dot{x}} \delta x-\left(\dot{x} \frac{\partial L}{\partial \dot{x}}-L\right) \delta t . \tag{4} \end{equation*}(4)δS=Lx˙δx(x˙Lx˙L)δt.
One concludes that the "dispersion relation" is obtained by taking the relations [compare (2) and (4)]
(5) ( rate of change of dynamic phase with position ) = ( momentum ) = p = L ( x , x ˙ , t ) x ˙ (5)  rate of change of   dynamic phase   with position  = (  momentum  ) = p = L ( x , x ˙ , t ) x ˙ {:(5)([" rate of change of "],[" dynamic phase "],[" with position "])=(" momentum ")=p=(del L(x,(x^(˙)),t))/(del(x^(˙))):}\left(\begin{array}{l} \text { rate of change of } \tag{5}\\ \text { dynamic phase } \\ \text { with position } \end{array}\right)=(\text { momentum })=p=\frac{\partial L(x, \dot{x}, t)}{\partial \dot{x}}(5)( rate of change of  dynamic phase  with position )=( momentum )=p=L(x,x˙,t)x˙
and
(6) ( rate of change of dynamic phase with time ) = ( energy ) = E = x ˙ L x ˙ L , (6)  rate of change of   dynamic phase   with time  = (  energy  ) = E = x ˙ L x ˙ L , {:(6)-([" rate of change of "],[" dynamic phase "],[" with time "])=(" energy ")=E=x^(˙)(del L)/(del(x^(˙)))-L",":}-\left(\begin{array}{l} \text { rate of change of } \tag{6}\\ \text { dynamic phase } \\ \text { with time } \end{array}\right)=(\text { energy })=E=\dot{x} \frac{\partial L}{\partial \dot{x}}-L,(6)( rate of change of  dynamic phase  with time )=( energy )=E=x˙Lx˙L,
and eliminating x ˙ x ˙ x^(˙)\dot{x}x˙ from them [solve (5) for x ˙ x ˙ x^(˙)\dot{x}x˙ and substitute that value of x ˙ x ˙ x^(˙)\dot{x}x˙ into (6)]; thus
(7) E = H ( p , x , t ) (7) E = H ( p , x , t ) {:(7)E=H(p","x","t):}\begin{equation*} E=H(p, x, t) \tag{7} \end{equation*}(7)E=H(p,x,t)
or
(8) S t = H ( S x , x , t ) (8) S t = H S x , x , t {:(8)-(del S)/(del t)=H((del S)/(del x),x,t):}\begin{equation*} -\frac{\partial S}{\partial t}=H\left(\frac{\partial S}{\partial x}, x, t\right) \tag{8} \end{equation*}(8)St=H(Sx,x,t)
Every feature of this elementary analysis has its analog in geometrodynamics.
a coordinate-free geometric-physical quantity. The great payoff of this work was recognition of the lapse and shift functions of equation (21.40) as Lagrange multipliers, the coefficients of which gave directly and simply Dirac's constraints. They did not succeed in arriving at a natural and simple time-coordinate, but that goal has in the meantime been achieved in the "extrinsic time" of Kuchar and York ( $ 21.11 $ 21.11 $21.11\$ 21.11$21.11 ). However, the Arnowitt-Deser Misner approach opened the door to the "intrinsic time" of Sharp, Baierlein, and Wheeler, where 3-geometry is fixed at limits, and 3-geometry is the carrier of information about time; and this led directly to Wheeler's "superspace version" of the treatment of Arnowitt, Deser, and Misner.
the 3 -geometry ( 3 ) y ( 3 ) y ^((3))y{ }^{(3)} \mathscr{y}(3)y of the initial spacelike hypersurface and the 3 -geometry ( 3 ) y ( 3 ) y ^((3))y{ }^{(3)} \mathscr{y}(3)y of the final spacelike hypersurface. One envisages the action integral as extremized with respect to the choice of the spacetime that fills the "sandwich" between these two faces. If one has thus determined the spacetime, one has automatically by that very act determined the separation in proper time of the two hypersurfaces. There is no additional time-variable to be brought in or considered. The one concept ( 3 ) y ( 3 ) y ^((3))y{ }^{(3)} y(3)y thus takes the place in geometrodynamics of the two quantities x , t x , t x,tx, tx,t of particle dynamics. The action S S SSS that there depended on x x xxx and t t ttt here depends on the 3 -geometry of the face of the sandwich; thus,
(21.9) S = S ( ( 3 ) y ) . (21.9) S = S ( 3 ) y . {:(21.9)S=S(^((3))y).:}\begin{equation*} S=S\left(^{(3)} y\right) . \tag{21.9} \end{equation*}(21.9)S=S((3)y).
A change in the 3-geometry changes the action. The amount of the change in action per elementary change in 3-geometry defines the "field momentum" π true i j π true  i j pi_("true ")^(ij)\pi_{\text {true }}^{i j}πtrue ij conjugate to the geometrodynamic field coordinate g i j g i j g_(ij)g_{i j}gij, according to the formula
(21.10) δ S = π true i j δ g i j d 3 x (21.10) δ S = π true  i j δ g i j d 3 x {:(21.10)delta S=intpi_("true ")^(ij)deltag_(ij)d^(3)x:}\begin{equation*} \delta S=\int \pi_{\text {true }}^{i j} \delta g_{i j} d^{3} x \tag{21.10} \end{equation*}(21.10)δS=πtrue ijδgijd3x
Comparing this equation out of the Arnowitt, Deser, and Misner (ADM) canonical formulation of geometrodynamics ( $ 21.7 $ 21.7 $21.7\$ 21.7$21.7 ) with the expression for change of action with change of endpoint in elementary mechanics,
(21.11) δ S = p δ x E δ t (21.11) δ S = p δ x E δ t {:(21.11)delta S=p delta x-E delta t:}\begin{equation*} \delta S=p \delta x-E \delta t \tag{21.11} \end{equation*}(21.11)δS=pδxEδt
one might at first think that something is awry, there being no obvious reference to time in (21.10). However, the 3-geometry is itself automatically the carrier of information about time; and (21.10) is complete. Moreover, with no "time" variable other than the information that ( 3 ) ( 3 ) ^((3)){ }^{(3)}(3) y itself already carries about time, there is also no "energy." Thus the "dispersion relation" that connects the rates of change of action with respect to the several changes that one can make in the "field coordinates" or 3-geometry takes the form
(21.12) K ( π i j , g m n ) = 0 , (21.12) K π i j , g m n = 0 , {:(21.12)K(pi^(ij),g_(mn))=0",":}\begin{equation*} \mathscr{K}\left(\pi^{i j}, g_{m n}\right)=0, \tag{21.12} \end{equation*}(21.12)K(πij,gmn)=0,
with the E-term of (21.8) equal to zero (details in §21.7). All the content of Einstein's general relativity can be extracted from this one Hamiltonian, or "super-Hamiltonian," to give it a more appropriate name [see DeWitt (1967a), pp. 1113-1118, for an account of the contributions of Dirac, of Arnowitt, Deser, and Misner, and of others to the Hamiltonian formulation of geometrodynamics; and see § 21.7 § 21.7 §21.7\S 21.7§21.7 and subsequent sections of this chapter for the meaning and payoffs of this formulation].
The difference between a Hamiltonian and a super-Hamiltonian [see, for example, Kramers (1957)] shows nowhere more clearly than in the problem of a charged particle moving in flat space under the influence of the field derived from the electromagnetic 4-potential, A μ ( x α ) A μ x α A_(mu)(x^(alpha))A_{\mu}\left(x^{\alpha}\right)Aμ(xα). The Hamiltonian treatment derives the equation of motion from the action principle,
0 = δ I = δ [ p i d x i d t H ( p j , x k , t ) ] d t 0 = δ I = δ p i d x i d t H p j , x k , t d t 0=delta I=delta int[p_(i)(dx^(i))/(dt)-H(p_(j),x^(k),t)]dt0=\delta I=\delta \int\left[p_{i} \frac{d x^{i}}{d t}-H\left(p_{j}, x^{k}, t\right)\right] d t0=δI=δ[pidxidtH(pj,xk,t)]dt
with
H = e c ϕ + [ m 2 + η i j ( p i + e c A i ) ( p j + e c A j ) ] 1 / 2 H = e c ϕ + m 2 + η i j p i + e c A i p j + e c A j 1 / 2 H=-(e)/(c)phi+[m^(2)+eta^(ij)(p_(i)+(e)/(c)A_(i))(p_(j)+(e)/(c)A_(j))]^(1//2)H=-\frac{e}{c} \phi+\left[m^{2}+\eta^{i j}\left(p_{i}+\frac{e}{c} A_{i}\right)\left(p_{j}+\frac{e}{c} A_{j}\right)\right]^{1 / 2}H=ecϕ+[m2+ηij(pi+ecAi)(pj+ecAj)]1/2
The super-Hamiltonian analysis gets the equations of motion from the action principle
0 = δ I = δ [ p μ d x μ d λ H ( p α , x β ) ] d λ 0 = δ I = δ p μ d x μ d λ H p α , x β d λ 0=deltaI^(')=delta int[p_(mu)(dx^(mu))/(d lambda)-H(p_(alpha),x^(beta))]d lambda0=\delta I^{\prime}=\delta \int\left[p_{\mu} \frac{d x^{\mu}}{d \lambda}-\mathscr{H}\left(p_{\alpha}, x^{\beta}\right)\right] d \lambda0=δI=δ[pμdxμdλH(pα,xβ)]dλ
Here the super-Hamiltonian is given by the expression
K ( p α , x β ) = 1 2 [ m 2 + η μ ν ( p μ + e c A μ ) ( p ν + e c A ν ) ] . K p α , x β = 1 2 m 2 + η μ ν p μ + e c A μ p ν + e c A ν . K(p_(alpha),x^(beta))=(1)/(2)[m^(2)+eta^(mu nu)(p_(mu)+(e)/(c)A_(mu))(p_(nu)+(e)/(c)A_(nu))].\mathscr{K}\left(p_{\alpha}, x^{\beta}\right)=\frac{1}{2}\left[m^{2}+\eta^{\mu \nu}\left(p_{\mu}+\frac{e}{c} A_{\mu}\right)\left(p_{\nu}+\frac{e}{c} A_{\nu}\right)\right] .K(pα,xβ)=12[m2+ημν(pμ+ecAμ)(pν+ecAν)].
The variational principle gives Hamilton's equations for the rates of change
d x α / d λ = K / p α d x α / d λ = K / p α dx^(alpha)//d lambda=delK//delp_(alpha)d x^{\alpha} / d \lambda=\partial \mathscr{K} / \partial p_{\alpha}dxα/dλ=K/pα
and
d p β / d λ = K / x β . d p β / d λ = K / x β . dp_(beta)//d lambda=-delK//delx^(beta).d p_{\beta} / d \lambda=-\partial \mathscr{K} / \partial x^{\beta} .dpβ/dλ=K/xβ.
From these equations, one discovers that K K K\mathscr{K}K itself must be a constant, independent of the time-like parameter λ λ lambda\lambdaλ. The value of this constant has to be imposed as an initial condition, K = 0 K = 0 K=0\mathscr{K}=0K=0 ("specification of particle mass"), thereafter maintained by the Hamiltonian equations themselves. This vanishing of K K K\mathscr{K}K in no way kills the partial derivatives,
H / p α and H / x β H / p α  and  H / x β delH//delp_(alpha)quad" and "quad-delH//delx^(beta)\partial \mathscr{H} / \partial p_{\alpha} \quad \text { and } \quad-\partial \mathscr{H} / \partial x^{\beta}H/pα and H/xβ
that enter Hamilton's equations for the rates of change,
d x α / d λ and d p β / d λ d x α / d λ  and  d p β / d λ dx^(alpha)//d lambdaquad" and "quad dp_(beta)//d lambdad x^{\alpha} / d \lambda \quad \text { and } \quad d p_{\beta} / d \lambdadxα/dλ and dpβ/dλ
Whether derived in the one formalism or the other, the equations of motion are equivalent, but the covariance shows more clearly in the formalism of the superHamiltonian, and similarly in general relativity.
Granted values of the "field coordinates" g i j ( x , y , z ) ( ( 3 ) y ) g i j ( x , y , z ) ( 3 ) y g_(ij)(x,y,z)(^((3))y)g_{i j}(x, y, z)\left({ }^{(3)} y\right)gij(x,y,z)((3)y) and field momenta π true i j ( x , y , z ) = δ S / δ g i j π true  i j ( x , y , z ) = δ S / δ g i j pi_("true ")^(ij)(x,y,z)=delta S//deltag_(ij)\pi_{\text {true }}^{i j}(x, y, z)=\delta S / \delta g_{i j}πtrue ij(x,y,z)=δS/δgij compatible with (21.12), one has what are called "compatible initial-value data on an initial spacelike hypersurface." One can proceed as described in § 21.8 § 21.8 §21.8\S 21.8§21.8 to integrate ahead in time step by step from one spacelike hypersurface to another and another, and construct the whole 4-geometry. Here one is dealing with what in mathematical terminology are hyperbolic differential equations that have the character of a wave equation.
In contrast, one deals with elliptic differential equations that have the character of a Poisson potential equation when one undertakes in the first place to construct the needed initial-value data ( $ 21.9 $ 21.9 $21.9\$ 21.9$21.9 ). In the analysis of these elliptic equations, it
Dynamic evolution of geometry
Another choice of what to fix at boundary hypersurface: conformal part of 3-geometry plus extrinsic time
Mach updated: mass-energy there governs inertia here
proves helpful to distinguish in the 3-geometry between (1) the part of the metric that determines relative lengths at a point, which is to say angles ("the conformal part of the metric") and (2) the common multiplicative factor that enters all the components of the g i j g i j g_(ij)g_{i j}gij at a point to determine the absolute scale of lengths at that point. This breakdown of the 3-geometry into two parts provides a particularly simple way to deal with two special initial-value problems known as the time-symmetric and time-antisymmetric initial-value problems ( $ 21.10 $ 21.10 $21.10\$ 21.10$21.10 ).
The ADM formalism is today in course of development as summarized in §21.11. In Wheeler's (1968a) "superspace" form, the ADM treatment takes the 3-geometry to be fixed on each of the bounding spacelike hypersurfaces. In contrast, York ( $ 21.11 $ 21.11 $21.11\$ 21.11$21.11 ) goes back to the original Hilbert action principle, and discovers what it takes to be fixed on each of the bounding spacelike hypersurfaces. The appropriate data turn out to be the "conformal part of the 3-geometry" plus something closely related to what Kuchař (1971a and 1972) calls the "extrinsic time." The contrast between Wheeler's approach and the Kuchař-York approach shows particularly clearly when one (1) deals with a flat spacetime manifold, (2) takes a flat spacelike section through this spacetime, and then (3) introduces a slight bump on this slice, of height ϵ ϵ epsilon\epsilonϵ. The 3-geometry intrinsic to this deformed slice differs from Euclidean geometry only to the second order in ϵ ϵ epsilon\epsilonϵ. Therefore to read back from the full 3-geometry to the time ("the forward advance of the bump") requires in this case an operation something like extracting a square root. In contrast, the Kuchař-York treatment deals with the "extrinsic curvature" of the slice, something proportional to the first power of ϵ ϵ epsilon\epsilonϵ, and therefore provides what is in some ways a more convenient measure of time [see especially Kuchař (1971) for the construction of "extrinsic time" for arbitrarily strong cylindrical gravitational waves; see also Box 30.1 on "time" as variously defined in "mixmaster cosmology"]. York shows that the time-variable is most conveniently identified with the variable "dynamically conjugate to the conformal factor in the 3 -geometry."
The initial-value problem of geometrodynamics can be formulated either in the language of Wheeler or in the language of Kuchař and York. In either formulation ( $ 21.9 $ 21.9 $21.9\$ 21.9$21.9 or § 21.11 § 21.11 §21.11\S 21.11§21.11 ) it throws light on what one ought properly today to understand by Mach's principle (§21.12). That principle meant to Mach that the "acceleration" dealt with in Newtonian mechanics could have a meaning only if it was acceleration with respect to the fixed stars or to something equally well-defined. It guided Einstein to general relativity. Today it is summarized in the principle that "mass-energy there governs inertia here," and is given mathematical expression in the initial-value equations.
The analysis of the initial-value problem connected past and future across a spacelike hypersurface. In contrast, one encounters a hypersurface that accommodates a timelike vector when one deals ( $ 21.13 $ 21.13 $21.13\$ 21.13$21.13 ) with the junction conditions between one solution of Einstein's field equation (say, the Friedmann geometry interior to a spherical cloud of dust of uniform density) and another (say, the Schwarzschild geometry exterior to this cloud of dust). Section 21.13, and the chapter, terminate with notes on gravitational shock waves and the characteristic initial-value problem (the statement of initial-value data on a light cone, for example).

§21.2. THE HILBERT ACTION PRINCIPLE AND THE PALATINI METHOD OF VARIATION

Five days before Einstein presented his geometrodynamic law in its final and now standard form, Hilbert, animated by Einstein's earlier work, independently discovered (1915a) how to formulate this law as the consequence of the simplest action principle of the form (21.2-21.3) that one can imagine:
(21.13) L geom = ( 1 / 16 π ) ( 4 ) R (21.13) L geom  = ( 1 / 16 π ) ( 4 ) R {:(21.13)L_("geom ")=(1//16 pi)^((4))R:}\begin{equation*} L_{\text {geom }}=(1 / 16 \pi)^{(4)} R \tag{21.13} \end{equation*}(21.13)Lgeom =(1/16π)(4)R
(Replace 1 / 16 π 1 / 16 π 1//16 pi1 / 16 \pi1/16π by c 3 / 16 π G c 3 / 16 π G c^(3)//16 pi Gc^{3} / 16 \pi Gc3/16πG when going from the present geometric units to conventional units; or divide by L 2 L 2 ℏ∼L^(**2)\hbar \sim L^{* 2}L2 to convert from dynamic phase, with the units of action, to actual phase of a wave function, with the units of radians). Here ( 4 ) R ( 4 ) R ^((4))R{ }^{(4)} R(4)R is the four-dimensional scalar curvature invariant, as spelled out in Box 8.4.
This action principle contains second derivatives of the metric coefficients. In contrast, the action principle for mechanics contains only first derivatives of the dynamic variables; and similarly only derivatives of the type A α / x β A α / x β delA_(alpha)//delx^(beta)\partial A_{\alpha} / \partial x^{\beta}Aα/xβ appear in the action principle for electrodynamics. Therefore one might also have expected only first derivatives, of the form g μ ν / x γ g μ ν / x γ delg_(mu nu)//delx^(gamma)\partial g_{\mu \nu} / \partial x^{\gamma}gμν/xγ, in the action principle here. However, no scalar invariant lets itself be constructed out of these first derivatives. Thus, to be an invariant, L geom L geom  L_("geom ")L_{\text {geom }}Lgeom  has to have a value independent of the choice of coordinate system. But in the neighborhood of a point, one can always so choose a coordinate system that all first derivatives of the g μ v g μ v g_(mu v)g_{\mu v}gμv vanish. Apart from a constant, there is no scalar invariant that can be built homogeneously out of the metric coefficients and their first derivatives.
When one turns from first derivatives to second derivatives, one has all twenty distinct components of the curvature tensor to work with. Expressed in a local inertial frame, these twenty components are arbitrary to the extent of the six parameters of a local Lorentz transformation. There are thus 20 6 = 14 20 6 = 14 20-6=1420-6=14206=14 independent local features of the curvature ("curvature invariants") that are coordinate-independent, any one of which one could imagine employing in the action principle. However, ( 4 ) R ( 4 ) R ^((4))R{ }^{(4)} R(4)R is the only one of these 14 quantities that is linear in the second derivatives of the metric coefficients. Any choice of invariant other than Hilbert's complicates the geometrodynamic law, and destroys the simple correspondence with the Newtonian theory of gravity (Chapter 17).
Hilbert originally conceived of the independently adjustable functions of x , y , z , t x , y , z , t x,y,z,tx, y, z, tx,y,z,t in the variational principle as being the ten distinct components of the metric tensor in contravariant representation, g μ ν g μ ν g^(mu nu)g^{\mu \nu}gμν. Later Palatini (1919) discovered a simpler and more instructive listing of the independently adjustable functions: not the ten g μ ν g μ ν g^(mu nu)g^{\mu \nu}gμν alone, but the ten g μ ν g μ ν g^(mu nu)g^{\mu \nu}gμν plus the forty Γ μ ν α Γ μ ν α Gamma_(mu nu)^(alpha)\Gamma_{\mu \nu}^{\alpha}Γμνα of the affine connection.
To give up the standard formula for the connection Γ Γ Gamma\GammaΓ in terms of the metric g g ggg and let Γ Γ Gamma\GammaΓ "flap in the breeze" is not a new kind of enterprise in mathematical physics. Even in the simplest problem of mechanics, one can give up the standard formula for the momentum p p ppp in terms of a time-derivative of the coordinate x x xxx and also let
Variational principle the simplest route to Einstein's equation
Scalar curvature invariant the only natural choice
Idea of varying coordinate and momentum independently
Variation of connection is a tensor
p p ppp "flap in the breeze." Then x ( t ) x ( t ) x(t)x(t)x(t) and p ( t ) p ( t ) p(t)p(t)p(t) become two independently adjustable functions in a new variational principle,
(21.14) I = x , t x , t [ p ( t ) d x ( t ) d t H ( p ( t ) , x ( t ) , t ) ] d t = extremum. (21.14) I = x , t x , t p ( t ) d x ( t ) d t H ( p ( t ) , x ( t ) , t ) d t =  extremum.  {:(21.14)I=int_(x^('),t^('))^(x,t)[p(t)(dx(t))/(dt)-H(p(t),x(t),t)]dt=" extremum. ":}\begin{equation*} I=\int_{x^{\prime}, t^{\prime}}^{x, t}\left[p(t) \frac{d x(t)}{d t}-H(p(t), x(t), t)\right] d t=\text { extremum. } \tag{21.14} \end{equation*}(21.14)I=x,tx,t[p(t)dx(t)dtH(p(t),x(t),t)]dt= extremum. 
Happily, out of the extremization with respect to choice of the function p ( t ) p ( t ) p(t)p(t)p(t), one recovers the standard formula for the momentum in terms of the velocity. The extremization with respect to choice of the other function, x ( t ) x ( t ) x(t)x(t)x(t), gives the equation of motion just as does the more elementary variational analysis of Euler and Lagrange, where x ( t ) x ( t ) x(t)x(t)x(t) is the sole adjustable function. A further analysis of this equivalence between the two kinds of variational principles in particle mechanics appears in Box 21.1. In that box, one also sees the two kinds of variational principle as applied to electrodynamics.
To express the Hilbert variational principle in terms of the Γ μ ν λ Γ μ ν λ Gamma_(mu nu)^(lambda)\Gamma_{\mu \nu}^{\lambda}Γμνλ and g α β g α β g^(alpha beta)g^{\alpha \beta}gαβ regarded as the primordial functions of t , x , y , z t , x , y , z t,x,y,zt, x, y, zt,x,y,z, note that the Lagrangian density is
(21.15) L geom ( g ) 1 / 2 = ( 1 / 16 π ) ( 4 ) R ( g ) 1 / 2 = ( 1 / 16 π ) g α β R α β ( g ) 1 / 2 (21.15) L geom  ( g ) 1 / 2 = ( 1 / 16 π ) ( 4 ) R ( g ) 1 / 2 = ( 1 / 16 π ) g α β R α β ( g ) 1 / 2 {:(21.15)L_("geom ")(-g)^(1//2)=(1//16 pi)^((4))R(-g)^(1//2)=(1//16 pi)g^(alpha beta)R_(alpha beta)(-g)^(1//2):}\begin{equation*} L_{\text {geom }}(-g)^{1 / 2}=(1 / 16 \pi)^{(4)} R(-g)^{1 / 2}=(1 / 16 \pi) g^{\alpha \beta} R_{\alpha \beta}(-g)^{1 / 2} \tag{21.15} \end{equation*}(21.15)Lgeom (g)1/2=(1/16π)(4)R(g)1/2=(1/16π)gαβRαβ(g)1/2
Here, as in any spacetime manifold with an affine connection, one has (Chapter 14)
(21.16) R α β = R α λ β λ , (21.16) R α β = R α λ β λ , {:(21.16)R_(alpha beta)=R_(alpha lambda beta)^(lambda)",":}\begin{equation*} R_{\alpha \beta}=R_{\alpha \lambda \beta}^{\lambda}, \tag{21.16} \end{equation*}(21.16)Rαβ=Rαλβλ,
where
(21.17) R α μ β λ = Γ α β λ / x μ Γ α μ λ / x β + Γ σ μ λ Γ α β σ Γ σ β λ Γ α μ σ , (21.17) R α μ β λ = Γ α β λ / x μ Γ α μ λ / x β + Γ σ μ λ Γ α β σ Γ σ β λ Γ α μ σ , {:(21.17)R_(alpha mu beta)^(lambda)=delGamma_(alpha beta)^(lambda)//delx^(mu)-delGamma_(alpha mu)^(lambda)//delx^(beta)+Gamma_(sigma mu)^(lambda)Gamma_(alpha beta)^(sigma)-Gamma_(sigma beta)^(lambda)Gamma_(alpha mu)^(sigma)",":}\begin{equation*} R_{\alpha \mu \beta}^{\lambda}=\partial \Gamma_{\alpha \beta}^{\lambda} / \partial x^{\mu}-\partial \Gamma_{\alpha \mu}^{\lambda} / \partial x^{\beta}+\Gamma_{\sigma \mu}^{\lambda} \Gamma_{\alpha \beta}^{\sigma}-\Gamma_{\sigma \beta}^{\lambda} \Gamma_{\alpha \mu}^{\sigma}, \tag{21.17} \end{equation*}(21.17)Rαμβλ=Γαβλ/xμΓαμλ/xβ+ΓσμλΓαβσΓσβλΓαμσ,
and every Γ Γ Gamma\GammaΓ is given in advance (in a coordinate frame) as symmetric in its two lower indices. In order that the integral I I III of (21.2-21.3) should be an extremum, one requires that the variation in I I III caused by changes both in the g μ ν g μ ν g^(mu nu)g^{\mu \nu}gμν and in the Γ Γ Gamma\GammaΓ 's should vanish; thus,
(21.18) 0 = δ I = ( 1 / 16 π ) δ [ g α β R α β ( g ) 1 / 2 ] d 4 x + δ [ L field ( g ) 1 / 2 ] d 4 x (21.18) 0 = δ I = ( 1 / 16 π ) δ g α β R α β ( g ) 1 / 2 d 4 x + δ L field  ( g ) 1 / 2 d 4 x {:(21.18)0=delta I=(1//16 pi)int delta[g^(alpha beta)R_(alpha beta)(-g)^(1//2)]d^(4)x+int delta[L_("field ")(-g)^(1//2)]d^(4)x:}\begin{equation*} 0=\delta I=(1 / 16 \pi) \int \delta\left[g^{\alpha \beta} R_{\alpha \beta}(-g)^{1 / 2}\right] d^{4} x+\int \delta\left[L_{\text {field }}(-g)^{1 / 2}\right] d^{4} x \tag{21.18} \end{equation*}(21.18)0=δI=(1/16π)δ[gαβRαβ(g)1/2]d4x+δ[Lfield (g)1/2]d4x
Consider now the variations of the individual factors in the first and second integrals in (21.18). The variation of the first factor is trivial, δ g α β δ g α β deltag^(alpha beta)\delta g^{\alpha \beta}δgαβ. In the variation of the second factor, R α β R α β R_(alpha beta)R_{\alpha \beta}Rαβ, changes in the g α β g α β g^(alpha beta)g^{\alpha \beta}gαβ play no part; only changes in the Γ Γ Gamma\GammaΓ 's appear. Moreover, the variation δ Γ α β λ δ Γ α β λ deltaGamma_(alpha beta)^(lambda)\delta \Gamma_{\alpha \beta}^{\lambda}δΓαβλ is a tensor even though Γ α β λ Γ α β λ Gamma_(alpha beta)^(lambda)\Gamma_{\alpha \beta}^{\lambda}Γαβλ itself is not. Thus in the transformation formula
(21.19) Γ α ¯ β ¯ γ ¯ β ¯ = [ Γ σ τ λ x σ x α ¯ x τ x β ¯ + 2 x λ x α ¯ x β ¯ ] x γ ¯ x λ , (21.19) Γ α ¯ β ¯ γ ¯ β ¯ = Γ σ τ λ x σ x α ¯ x τ x β ¯ + 2 x λ x α ¯ x β ¯ x γ ¯ x λ , {:(21.19)Gamma_( bar(alpha) bar(beta))^( bar(gamma)_( bar(beta)))=[Gamma_(sigma tau)^(lambda)(delx^(sigma))/(delx^( bar(alpha)))(delx^(tau))/(delx^( bar(beta)))+(del^(2)x^(lambda))/(delx^( bar(alpha))delx^( bar(beta)))](delx^( bar(gamma)))/(delx^(lambda))",":}\begin{equation*} \Gamma_{\bar{\alpha} \bar{\beta}}^{\bar{\gamma}_{\bar{\beta}}}=\left[\Gamma_{\sigma \tau}^{\lambda} \frac{\partial x^{\sigma}}{\partial x^{\bar{\alpha}}} \frac{\partial x^{\tau}}{\partial x^{\bar{\beta}}}+\frac{\partial^{2} x^{\lambda}}{\partial x^{\bar{\alpha}} \partial x^{\bar{\beta}}}\right] \frac{\partial x^{\bar{\gamma}}}{\partial x^{\lambda}}, \tag{21.19} \end{equation*}(21.19)Γα¯β¯γ¯β¯=[Γστλxσxα¯xτxβ¯+2xλxα¯xβ¯]xγ¯xλ,
the last term destroys the tensor character of any set of Γ σ τ λ Γ σ τ λ Gamma_(sigma tau)^(lambda)\Gamma_{\sigma \tau}^{\lambda}Γστλ individually, but subtracts out in the difference δ Γ σ τ λ δ Γ σ τ λ deltaGamma_(sigma tau)^(lambda)\delta \Gamma_{\sigma \tau}^{\lambda}δΓστλ between two alternative sets of Γ Γ Gamma\GammaΓ 's. Note that the variation δ R λ α μ β δ R λ α μ β deltaR^(lambda)_(alpha mu beta)\delta R^{\lambda}{ }_{\alpha \mu \beta}δRλαμβ of the typical component of the curvature tensor consists of two terms of
(continued on page 500)

Box 21.1 RATE OF CHANGE OF ACTION WITH DYNAMIC COORDINATE ( = '"MOMENTUM') AND WITH TIME, AND THE DISPERSION RELATION ( = 'HAMILTONIAN") THAT CONNECTS THEM IN PARTICLE MECHANICS AND IN ELECTRODYNAMICS

A. PROLOG ON THE PARTICLE-MECHANICS ANALOG OF THE PALATINI METHOD

In particle mechanics, one considers the history x = x ( t ) x = x ( t ) x=x(t)x=x(t)x=x(t) to be adjustable between the end points ( x , t ) x , t (x^('),t^('))\left(x^{\prime}, t^{\prime}\right)(x,t) and ( x , t ) ( x , t ) (x,t)(x, t)(x,t) and varies it to extremize the integral I = I = I=I=I= L ( x , x ˙ , t ) d t L ( x , x ˙ , t ) d t int L(x,x^(˙),t)dt\int L(x, \dot{x}, t) d tL(x,x˙,t)dt taken between these two limits.
Expressed in terms of coordinates and momenta
(see Figure 21.1), the integral has the form
(1) I = [ p x ˙ H ( p , x , t ) ] d t (1) I = [ p x ˙ H ( p , x , t ) ] d t {:(1)I=int[px^(˙)-H(p","x","t)]dt:}\begin{equation*} I=\int[p \dot{x}-H(p, x, t)] d t \tag{1} \end{equation*}(1)I=[px˙H(p,x,t)]dt
where x ( t ) x ( t ) x(t)x(t)x(t) is again the function to be varied and p p ppp is only an abbreviation for a certain function of x x xxx and x ˙ x ˙ x^(˙)\dot{x}x˙; thus, p = L ( x , x ˙ , t ) / x ˙ p = L ( x , x ˙ , t ) / x ˙ p=del L(x,x^(˙),t)//delx^(˙)p=\partial L(x, \dot{x}, t) / \partial \dot{x}p=L(x,x˙,t)/x˙. Viewed in this way, the variation, δ p ( t ) δ p ( t ) delta p(t)\delta p(t)δp(t), of the momentum is governed by, and is only a reflection of, the variation δ x ( t ) δ x ( t ) delta x(t)\delta x(t)δx(t).

1. Momentum Treated as Independently Variable

There miraculously exists, however, quite another way to view the problem (see inset). One can regard x ( t ) x ( t ) x(t)x(t)x(t) and p ( t ) p ( t ) p(t)p(t)p(t) as two quite uncorrelated and independently adjustable functions. One abandons the formula p = L ( x , x ˙ , t ) / x ˙ p = L ( x , x ˙ , t ) / x ˙ p=del L(x,x^(˙),t)//delx^(˙)p=\partial L(x, \dot{x}, t) / \partial \dot{x}p=L(x,x˙,t)/x˙, only to recover it,
or the equivalent of it, from the new "independ-ent-coordinate-and-momentum version" of the variation principle.
The variation of (1), as defined and calculated in this new way, becomes
(2) δ I = p δ x | x , t x , t + x , t x , t [ ( x ˙ H p ) δ p + ( p ˙ H x ) δ x ] d t . (2) δ I = p δ x x , t x , t + x , t x , t x ˙ H p δ p + p ˙ H x δ x d t . {:(2)delta I=p delta x|_(x^('),t^('))^(x^(''),t^(''))+int_(x^('),t^('))^(x^(''),t^(''))[((x^(˙))-(del H)/(del p))delta p+(-(p^(˙))-(del H)/(del x))delta x]dt.:}\begin{equation*} \delta I=\left.p \delta x\right|_{x^{\prime}, t^{\prime}} ^{x^{\prime \prime}, t^{\prime \prime}}+\int_{x^{\prime}, t^{\prime}}^{x^{\prime \prime}, t^{\prime \prime}}\left[\left(\dot{x}-\frac{\partial H}{\partial p}\right) \delta p+\left(-\dot{p}-\frac{\partial H}{\partial x}\right) \delta x\right] d t . \tag{2} \end{equation*}(2)δI=pδx|x,tx,t+x,tx,t[(x˙Hp)δp+(p˙Hx)δx]dt.
Demand that the coefficient of δ p δ p delta p\delta pδp vanish and have the sought-for new version,
x ˙ = H ( p , x , t ) p x ˙ = H ( p , x , t ) p x^(˙)=(del H(p,x,t))/(del p)\dot{x}=\frac{\partial H(p, x, t)}{\partial p}x˙=H(p,x,t)p
of the old relation, p = L ( x , x ˙ , t ) / x ˙ p = L ( x , x ˙ , t ) / x ˙ p=del L(x,x^(˙),t)//delx^(˙)p=\partial L(x, \dot{x}, t) / \partial \dot{x}p=L(x,x˙,t)/x˙, between momentum and velocity. The vanishing of the coefficient of δ x δ x delta x\delta xδx gives the other Hamilton equation,
(3) p ˙ = H ( p , x , t ) x (3) p ˙ = H ( p , x , t ) x {:(3)p^(˙)=-(del H(p,x,t))/(del x):}\begin{equation*} \dot{p}=-\frac{\partial H(p, x, t)}{\partial x} \tag{3} \end{equation*}(3)p˙=H(p,x,t)x
equivalent in content to the original Lagrange equation of motion,
(4) d d t L x ˙ L x = 0 (4) d d t L x ˙ L x = 0 {:(4)(d)/(dt)(del L)/(del(x^(˙)))-(del L)/(del x)=0:}\begin{equation*} \frac{d}{d t} \frac{\partial L}{\partial \dot{x}}-\frac{\partial L}{\partial x}=0 \tag{4} \end{equation*}(4)ddtLx˙Lx=0
That p ( t ) p ( t ) p(t)p(t)p(t) in this double variable conception is-before the extremization!-a function of time quite separate from and independent of the function x ( t ) x ( t ) x(t)x(t)x(t) shows nowhere more clearly than in the circumstance that p ( t ) p ( t ) p(t)p(t)p(t) has no end point conditions imposed on it, whereas x x x^(')x^{\prime}x and x x x^('')x^{\prime \prime}x are specified. Thus not only is the shape of the history subject to adjustment in x , p , t x , p , t x,p,tx, p, tx,p,t space in the course of achieving the extremum, but even the end points are subject to being slid along the two indicated lines in the inset, like beads on a wire.

2. Action as Tool for Finding Dispersion Relation

Denote by S ( x , t ) S ( x , t ) S(x,t)S(x, t)S(x,t) the "action," or extremal value of I I III, for the classical history that starts with ( x , t ) x , t (x^('),t^('))\left(x^{\prime}, t^{\prime}\right)(x,t) and ends at ( x , t ) ( = ( x , t ) ( = (x,t)(=ℏ(x, t)(=\hbar(x,t)(= times phase of de Broglie wave). To change the end points to ( x + δ x , t ) ( x + δ x , t ) (x+delta x,t)(x+\delta x, t)(x+δx,t) makes the change in action
(5) δ S = p δ x (5) δ S = p δ x {:(5)delta S=p delta x:}\begin{equation*} \delta S=p \delta x \tag{5} \end{equation*}(5)δS=pδx
Thus momentum is "rate of change of action with dynamic coordinate."
To change the end point to
(6) ( x + δ x , t + δ t ) = ( [ x + x ˙ δ t ] + [ δ x x ˙ δ t ] , t + δ t ) (6) ( x + δ x , t + δ t ) = ( [ x + x ˙ δ t ] + [ δ x x ˙ δ t ] , t + δ t ) {:(6)(x+delta x","t+delta t)=([x+x^(˙)delta t]+[delta x-x^(˙)delta t]","t+delta t):}\begin{equation*} (x+\delta x, t+\delta t)=([x+\dot{x} \delta t]+[\delta x-\dot{x} \delta t], t+\delta t) \tag{6} \end{equation*}(6)(x+δx,t+δt)=([x+x˙δt]+[δxx˙δt],t+δt)
makes the change in action
(7) δ S = p [ δ x x ˙ δ t ] + L δ t = p δ x H δ t . (7) δ S = p [ δ x x ˙ δ t ] + L δ t = p δ x H δ t . {:(7)delta S=p[delta x-x^(˙)delta t]+L delta t=p delta x-H delta t.:}\begin{equation*} \delta S=p[\delta x-\dot{x} \delta t]+L \delta t=p \delta x-H \delta t . \tag{7} \end{equation*}(7)δS=p[δxx˙δt]+Lδt=pδxHδt.
Thus the Hamiltonian is the negative of "the rate of change of action with time."
In terms of the Hamiltonian H = H ( p , x ) H = H ( p , x ) H=H(p,x)H=H(p, x)H=H(p,x), the "dispersion relation" for de Broglie waves becomes
(8) S t = H ( S x , x ) (8) S t = H S x , x {:(8)-(del S)/(del t)=H((del S)/(del x),x):}\begin{equation*} -\frac{\partial S}{\partial t}=H\left(\frac{\partial S}{\partial x}, x\right) \tag{8} \end{equation*}(8)St=H(Sx,x)
In the derivation of this dispersion relation, one can profitably short-cut all talk of p ( t ) p ( t ) p(t)p(t)p(t) and x ( t ) x ( t ) x(t)x(t)x(t) as independently variable quantities, and derive the result in hardly
more than one step from the definition I = L ( x , x ˙ , t ) d t I = L ( x , x ˙ , t ) d t I=int L(x,x^(˙),t)dtI=\int L(x, \dot{x}, t) d tI=L(x,x˙,t)dt. Similarly in electrodynamics.
The remainder of this box best follows a first perusal of Chapter 21.

B. ANALOG OF THE PALATINI METHOD IN ELECTRODYNAMICS

In source-free electrodynamics, one considers as given two spacelike hypersurfaces S S S^(')S^{\prime}S and S S S^('')S^{\prime \prime}S, and the magnetic fields-as-a-function-of-position in each, B B B^(')B^{\prime}B and B B B^('')B^{\prime \prime}B (this second field will later be written without the " superscript to simplify the notation). To be varied is an integral extended over the region of spacetime between the two hypersurfaces,
(9) I Maxwell E Maxwell d 4 x = 1 16 π F μ ν F μ ν ( g ) 1 / 2 d 4 x (9) I Maxwell  E Maxwell  d 4 x = 1 16 π F μ ν F μ ν ( g ) 1 / 2 d 4 x {:(9)I_("Maxwell ")-=intE_("Maxwell ")d^(4)x=-(1)/(16 pi)intF^(mu nu)F_(mu nu)(-g)^(1//2)d^(4)x:}\begin{equation*} I_{\text {Maxwell }} \equiv \int \mathcal{E}_{\text {Maxwell }} d^{4} x=-\frac{1}{16 \pi} \int F^{\mu \nu} F_{\mu \nu}(-g)^{1 / 2} d^{4} x \tag{9} \end{equation*}(9)IMaxwell EMaxwell d4x=116πFμνFμν(g)1/2d4x

1. Variation of Field on Hypersurface and Variation of Location of Hypersurface are Cleanly Separated Concepts in Electromagnetism

The electromagnetic field F F F\boldsymbol{F}F is the physically relevant quantity in electromagnetism (compare the 3 -geometry in geometrodynamics). By contrast, the 4 -potential A A A\boldsymbol{A}A has no direct physical significance. A change of gauge in the potentials,
A μ = A μ new + λ / x μ A μ = A μ new  + λ / x μ A_(mu)=A_(mu_("new "))+del lambda//delx^(mu)A_{\mu}=A_{\mu_{\text {new }}}+\partial \lambda / \partial x^{\mu}Aμ=Aμnew +λ/xμ
leaves unchanged the field components
F μ ν = A ν / x μ A μ / x ν F μ ν = A ν / x μ A μ / x ν F_(mu nu)=delA_(nu)//delx^(mu)-delA_(mu)//delx^(nu)F_{\mu \nu}=\partial A_{\nu} / \partial x^{\mu}-\partial A_{\mu} / \partial x^{\nu}Fμν=Aν/xμAμ/xν
(compare the coordinate transformation that changes the g μ ν g μ ν g_(mu nu)g_{\mu \nu}gμν while leaving unchanged the ( 3 ) y ) ( 3 ) y {:^((3))y)\left.{ }^{(3)} \mathscr{y}\right)(3)y). The variation of the fields within the body of the sandwich is nevertheless expressed most conveniently in terms of the effect of changes δ A μ δ A μ deltaA_(mu)\delta A_{\mu}δAμ in the potentials.
One also wants to see how the action integral is influenced by changes in the location of the upper spacelike hypersurface ("many-fingered time"). Think of the point of the hypersurface that is presently endowed with coordinates x , y , z , t ( x , y , z ) x , y , z , t ( x , y , z ) x,y,z,t(x,y,z)x, y, z, t(x, y, z)x,y,z,t(x,y,z) as being displaced to x , y , z , t + δ t ( x , y , z ) x , y , z , t + δ t ( x , y , z ) x,y,z,t+delta t(x,y,z)x, y, z, t+\delta t(x, y, z)x,y,z,t+δt(x,y,z). Now renounce this use of a privileged coordinate system. Describe the displacement of the simultaneity in terms of a 4 -vector δ n δ n delta n\delta \boldsymbol{n}δn (not a unit 4 -vector) normal to the hypersurface Σ Σ Sigma\SigmaΣ. The element of 4 -volume δ Ω δ Ω delta Omega\delta \OmegaδΩ included between the original upper face of the sandwich and the new upper face, that had in the privileged coordinate system the form ( g ) 1 / 2 δ t ( x , y , z ) d 3 x ( g ) 1 / 2 δ t ( x , y , z ) d 3 x (-g)^(1//2)delta t(x,y,z)d^(3)x(-g)^{1 / 2} \delta t(x, y, z) d^{3} x(g)1/2δt(x,y,z)d3x, in the notation of Chapter 20 becomes
(10) δ Ω = δ n μ d 3 Σ μ = ( δ n d 3 Σ ) (10) δ Ω = δ n μ d 3 Σ μ = δ n d 3 Σ {:(10)delta Omega=deltan^(mu)d^(3)Sigma_(mu)=(delta n*d^(3)Sigma):}\begin{equation*} \delta \Omega=\delta n^{\mu} d^{3} \Sigma_{\mu}=\left(\delta \boldsymbol{n} \cdot d^{3} \boldsymbol{\Sigma}\right) \tag{10} \end{equation*}(10)δΩ=δnμd3Σμ=(δnd3Σ)
where the element of surface d 3 Σ μ d 3 Σ μ d^(3)Sigma_(mu)d^{3} \Sigma_{\mu}d3Σμ already includes the previously listed factor ( g ) 1 / 2 ( g ) 1 / 2 (-g)^(1//2)(-g)^{1 / 2}(g)1/2.
Box 21.1 (continued)
Counting together the influence of changes in the field values on the upper hypersurface and changes in the location of that hypersurface, one has
δ S = δ I extremal = ( 1 / 16 π ) upper Σ F μ ν F μ ν ( δ n d 3 Σ ) (11) + ( 1 / 4 π ) upper Σ F μ ν Δ A μ replace by d 3 Σ ν its equivalent δ S = δ I extremal  = ( 1 / 16 π ) upper  Σ F μ ν F μ ν δ n d 3 Σ (11) + ( 1 / 4 π ) upper  Σ F μ ν Δ A μ replace by  d 3 Σ ν  its equivalent  {:[delta S=deltaI_("extremal ")=-(1//16 pi)int_("upper "Sigma)F^(mu nu)F_(mu nu)(delta n*d^(3)Sigma)],[(11)+(1//4pi)int_("upper "Sigma)F^(mu nu)ubrace(DeltaA_(mu)ubrace)_("replace by ")d^(3)Sigma_(nu)],[" its equivalent "]:}\begin{align*} & \delta S=\delta I_{\text {extremal }}=-(1 / 16 \pi) \int_{\text {upper } \Sigma} F^{\mu \nu} F_{\mu \nu}\left(\delta \boldsymbol{n} \cdot d^{3} \boldsymbol{\Sigma}\right) \\ & +(1 / 4 \pi) \int_{\text {upper } \Sigma} F^{\mu \nu} \underbrace{\Delta A_{\mu}}_{\text {replace by }} d^{3} \Sigma_{\nu} \tag{11}\\ & \text { its equivalent } \end{align*}δS=δIextremal =(1/16π)upper ΣFμνFμν(δnd3Σ)(11)+(1/4π)upper ΣFμνΔAμreplace by d3Σν its equivalent 
Simplify this expression by arranging the coordinates so that the hypersurface shall be a hypersurface of constant t t ttt, and so that lines of constant x , y , z x , y , z x,y,zx, y, zx,y,z shall be normal to this hypersurface. Then it follows that the element of volume on that hypersurface contains a single nonvanishing component, d 3 Σ 0 = ( g ) 1 / 2 d 3 x d 3 Σ 0 = ( g ) 1 / 2 d 3 x d^(3)Sigma_(0)=(-g)^(1//2)d^(3)xd^{3} \Sigma_{0}=(-g)^{1 / 2} d^{3} xd3Σ0=(g)1/2d3x. The antisymmetry of the field quantity F 0 v F 0 v F^(0v)F^{0 v}F0v in its two indices requires that ν ν nu\nuν be a spacelike label, i = 1 , 2 , 3 i = 1 , 2 , 3 i=1,2,3i=1,2,3i=1,2,3. The variation of the action becomes
δ S = [ ( g ) 1 / 2 F i 0 4 π δ A i add and subtract { ( g ) 1 / 2 F i 0 4 π A i ; 0 4 π E Maxwell } δ t ] d 3 x . δ S = [ ( g ) 1 / 2 F i 0 4 π δ A i  add and subtract  ( g ) 1 / 2 F i 0 4 π A i ; 0 4 π E Maxwell  } δ t ] d 3 x . delta S=int[((-g)^(1//2)F^(i0))/(4pi)deltaA_(i)- obrace(" add and subtract "{((-g)^(1//2)F^(i0))/(4pi)A_(i;0))^(4pi)-E_("Maxwell ")}delta t]d^(3)x.\delta S=\int[\frac{(-g)^{1 / 2} F^{i 0}}{4 \pi} \delta A_{i}-\overbrace{\text { add and subtract }\left\{\frac{(-g)^{1 / 2} F^{i 0}}{4 \pi} A_{i ; 0}\right.}^{4 \pi}-\mathcal{E}_{\text {Maxwell }}\} \delta t] d^{3} x .δS=[(g)1/2Fi04πδAi add and subtract {(g)1/2Fi04πAi;04πEMaxwell }δt]d3x.

2. Meaning of Field "Momentum" in Electrodynamics

Identify this expression with the quantity
(13) δ S = π E M i δ A i d 3 x K δ Ω , (13) δ S = π E M i δ A i d 3 x K δ Ω , {:(13)delta S=intpi_(EM)^(i)deltaA_(i)d^(3)x-intKdelta Omega",":}\begin{equation*} \delta S=\int \pi_{E M}^{i} \delta A_{i} d^{3} x-\int \mathscr{K} \delta \Omega, \tag{13} \end{equation*}(13)δS=πEMiδAid3xKδΩ,
where
(14) π E M i = δ S δ A i = ( "density of electromagnetic momentum dynamically canon- ically conjugate to A i " ) = ( g ) 1 / 2 F i 0 4 π = E i 4 π (14) π E M i = δ S δ A i =  "density of electromagnetic   momentum dynamically canon-   ically conjugate to  A i  "  = ( g ) 1 / 2 F i 0 4 π = E i 4 π {:(14)pi_(EM)^(i)=(delta S)/(deltaA_(i))=([" "density of electromagnetic "],[" momentum dynamically canon- "],[" ically conjugate to "A_(i)" " "])=((-g)^(1//2)F^(i0))/(4pi)=-(E^(i))/(4pi):}\pi_{E M}^{i}=\frac{\delta S}{\delta A_{i}}=\left(\begin{array}{l} \text { "density of electromagnetic } \tag{14}\\ \text { momentum dynamically canon- } \\ \text { ically conjugate to } A_{i} \text { " } \end{array}\right)=\frac{(-g)^{1 / 2} F^{i 0}}{4 \pi}=-\frac{\mathcal{E}^{i}}{4 \pi}(14)πEMi=δSδAi=( "density of electromagnetic  momentum dynamically canon-  ically conjugate to Ai " )=(g)1/2Fi04π=Ei4π
is a simple multiple of the electric field and where
(15) K = δ S δ Ω = ( "density of electromagnetic Hamiltonian" ) = ( 1 / 16 π ) [ F μ ν F μ ν + 4 F i 0 ( A i ; 0 A 0 ; i ) ] = ( 1 / 8 π ) ( E 2 + B 2 ) . (15) K = δ S δ Ω =  "density of   electromagnetic   Hamiltonian"  = ( 1 / 16 π ) F μ ν F μ ν + 4 F i 0 A i ; 0 A 0 ; i = ( 1 / 8 π ) E 2 + B 2 . {:[(15)K=-(delta S)/(delta Omega)=([" "density of "],[" electromagnetic "],[" Hamiltonian" "])=(1//16 pi)[F^(mu nu)F_(mu nu)+4F^(i0)(A_(i;0)-A_(0;i))]],[=(1//8pi)(E^(2)+B^(2)).]:}\begin{align*} \mathscr{K}=-\frac{\delta S}{\delta \Omega}=\left(\begin{array}{l} \text { "density of } \\ \text { electromagnetic } \\ \text { Hamiltonian" } \end{array}\right) & =(1 / 16 \pi)\left[F^{\mu \nu} F_{\mu \nu}+4 F^{i 0}\left(A_{i ; 0}-A_{0 ; i}\right)\right] \tag{15}\\ & =(1 / 8 \pi)\left(\boldsymbol{E}^{2}+\boldsymbol{B}^{2}\right) . \end{align*}(15)K=δSδΩ=( "density of  electromagnetic  Hamiltonian" )=(1/16π)[FμνFμν+4Fi0(Ai;0A0;i)]=(1/8π)(E2+B2).
The concept of dynamic Hamiltonian density agrees with the usual concept of density of electromagnetic energy, despite the very different context in which the two quantities are derived and used. However, the canonical momentum π E M i π E M i pi_(EM)^(i)\pi_{E M}^{i}πEMi has nothing directly whatsoever to do with the density of electromagnetic momentum as defined, for example, by the Poynting vector, despite the confusing similarity in the standard names for the two quantities. Note that there is no term δ A 0 δ A 0 deltaA_(0)\delta A_{0}δA0 in (13); that is, π E M 0 0 π E M 0 0 pi_(EM)^(0)-=0\pi_{E M}^{0} \equiv 0πEM00.

3. Bubble Differentiation

The "bubble differentiation" with respect to "many-fingered time" that appears in (15) was first introduced by Tomonaga (1946). One thinks of a spacelike hypersurface Σ 1 Σ 1 Sigma_(1)\Sigma_{1}Σ1, a magnetic field B B B\boldsymbol{B}B defined as a function of position on this hypersurface (by an observer on a world line normal to this hypersurface), and a prescription S S SSS that carries one from this information to a single number, the action. (Divided by \hbar, this action gives the phase of the "wave function" or "probability amplitude" for the occurrence of this particular distribution of field values over this particular hypersurface.) One goes to a second hypersurface Σ 2 Σ 2 Sigma_(2)\Sigma_{2}Σ2 (see inset),
which is identical with Σ 1 Σ 1 Sigma_(1)\Sigma_{1}Σ1, except in the immediate vicinity of a given point. Take a distribution of field values over Σ 2 Σ 2 Sigma_(2)\Sigma_{2}Σ2 that is identical with the original distribution over Σ 1 Σ 1 Sigma_(1)\Sigma_{1}Σ1, "identity of location" being defined by means of the normal. Evaluate the difference, δ S δ S delta S\delta SδS, in the value of the dynamic phase or action in the two cases. Divide this difference by the amount of proper 4 -volume δ Ω = δ Ω = delta Omega=\delta \Omega=δΩ= ( δ n d 3 Σ ) δ n d 3 Σ int(delta n*d^(3)Sigma)\int\left(\delta \boldsymbol{n} \cdot d^{3} \boldsymbol{\Sigma}\right)(δnd3Σ) contained in the "bubble" between the two hypersurfaces. Take the quotient, evaluate it in the limit in which the size of the bubble goes to zero, and in this way get the "bubble-time derivative," δ S / δ Ω δ S / δ Ω delta S//delta Omega\delta S / \delta \OmegaδS/δΩ, of the action.

Box 21.1 (continued)

What does it mean to say that the action, S S SSS, besides depending on the hypersurface, Σ Σ Sigma\SigmaΣ, depends also on the distribution of the magnetic field, B B BBB, over that hypersurface? The action depends on the physical quantity, B = × A B = × A B=grad xx A\boldsymbol{B}=\boldsymbol{\nabla} \times \boldsymbol{A}B=×A, not on the prephysical quantity, A A A\boldsymbol{A}A. Thus a change in gauge δ A i = λ / x i δ A i = λ / x i deltaA_(i)=del lambda//delx^(i)\delta A_{i}=\partial \lambda / \partial x^{i}δAi=λ/xi, cannot make any change in S S SSS. On the other hand, the calculated value of the change in S S SSS for this alteration in A A A\boldsymbol{A}A is
δ ( action ) = δ S = δ S δ A i δ A i d 3 x (16) = δ S δ A i λ x i d 3 x = ( δ S δ A i ) , i λ ( x , y , z ) d 3 x δ (  action  ) = δ S = δ S δ A i δ A i d 3 x (16) = δ S δ A i λ x i d 3 x = δ S δ A i , i λ ( x , y , z ) d 3 x {:[delta(" action ")=delta S=int(delta S)/(deltaA_(i))deltaA_(i)d^(3)x],[(16)=int(delta S)/(deltaA_(i))(del lambda)/(delx^(i))d^(3)x=-int((delta S)/(deltaA_(i)))_(,i)lambda(x","y","z)d^(3)x]:}\begin{align*} \delta(\text { action }) & =\delta S=\int \frac{\delta S}{\delta A_{i}} \delta A_{i} d^{3} x \\ & =\int \frac{\delta S}{\delta A_{i}} \frac{\partial \lambda}{\partial x^{i}} d^{3} x=-\int\left(\frac{\delta S}{\delta A_{i}}\right)_{, i} \lambda(x, y, z) d^{3} x \tag{16} \end{align*}δ( action )=δS=δSδAiδAid3x(16)=δSδAiλxid3x=(δSδAi),iλ(x,y,z)d3x
In order that there shall be no dependence of action on gauge, it follows that this expression must vanish for arbitrary λ ( x , y , z ) λ ( x , y , z ) lambda(x,y,z)\lambda(x, y, z)λ(x,y,z), a result only possible if S ( Σ , B ) = S ( Σ , B ) = S(Sigma,B)=S(\Sigma, \boldsymbol{B})=S(Σ,B)= S S SSS (hypersurface, field on hypersurface) satisfies the identity
(17) ( δ S δ A i ) , i = π E M , i i = ( 1 / 4 π ) E , i i = 0 (17) δ S δ A i , i = π E M , i i = ( 1 / 4 π ) E , i i = 0 {:(17)((delta S)/(deltaA_(i)))_(,i)=pi_(EM,i)^(i)=-(1//4pi)E_(,i)^(i)=0:}\begin{equation*} \left(\frac{\delta S}{\delta A_{i}}\right)_{, i}=\pi_{E M, i}^{i}=-(1 / 4 \pi) \mathcal{E}_{, i}^{i}=0 \tag{17} \end{equation*}(17)(δSδAi),i=πEM,ii=(1/4π)E,ii=0

4. Hamilton-Jacobi "Propagation Law" for Electrodynamics

The "dispersion relation" or "Hamilton-Jacobi equation" for electromagnetism relates (1) the changes of the "dynamic phase" or "action" brought about by alterations in the dynamic variables A i A i A_(i)A_{i}Ai (the generalization of the x x xxx of particle dynamics) with (2) the changes brought about by alterations in many-fingered time (the generalization of the single time t t ttt of particle dynamics); thus (15) translates into
(18) δ S δ Ω = ( 4 π ) 2 8 π ( δ S δ A ) 2 + 1 ( 8 π ) ( × A ) 2 (18) δ S δ Ω = ( 4 π ) 2 8 π δ S δ A 2 + 1 ( 8 π ) ( × A ) 2 {:(18)-(delta S)/(delta Omega)=((4pi)^(2))/(8pi)((delta S)/(delta A))^(2)+(1)/((8pi))(grad xx A)^(2):}\begin{equation*} -\frac{\delta S}{\delta \Omega}=\frac{(4 \pi)^{2}}{8 \pi}\left(\frac{\delta S}{\delta \boldsymbol{A}}\right)^{2}+\frac{1}{(8 \pi)}(\nabla \times \boldsymbol{A})^{2} \tag{18} \end{equation*}(18)δSδΩ=(4π)28π(δSδA)2+1(8π)(×A)2

C. DISPERSION RELATIONS FOR GEOMETRODYNAMICS AND ELECTRODYNAMICS COMPARED AND CONTRASTED

Geometrodynamics possesses a direct analog of equation (17) ("action depends on no information carried by the vector potential A A AAA except the magnetic field B = × A B = × A B=grad xxA^('')\boldsymbol{B}=\boldsymbol{\nabla} \times \boldsymbol{A}^{\prime \prime}B=×A ), in an equation that says the action depends on no information carried by the metric g i j g i j g_(ij)g_{i j}gij on the "upper face of the sandwich" except the 3 -geometry there, ( 3 ) y ( 3 ) y ^((3))y{ }^{(3)} y(3)y. It also possesses a direct analog of equation (18) ("dynamic equation for the propagation of the action") with this one difference: in electrodynamics the field variable B B B\boldsymbol{B}B and the many-fingered time are distinct in character, whereas in geometrodynamics the "field" and the "many-fingered time" can be regarded as two aspects of one and the same ( 3 ) ξ ( 3 ) ξ ^((3))xi{ }^{(3)} \xi(3)ξ :

D. ACTION PRINCIPLE AND DISPERSION RELATION ARE ROOTED IN THE QUANTUM PRINCIPLE; FEYNMAN'S PRINCIPLE OF THE DEMOCRATIC EQUALITY OF ALL HISTORIES

For more on action principles in physics, see for example Mercier (1953), Lanczos (1970), and Yourgrau and Mandelstam (1968).
Newton (1687) in the first page of the preface to the first edition of his Principia notes that "The description of right lines..., upon which geometry is founded, belongs to mechanics. Geometry does not teach us to draw these lines, but requires them to be drawn."
Newton's remark is also a question. Mechanics moves a particle along a straight line, but what is the machinery by which mechanics accomplishes this miracle? The quantum principle gives the answer. The particle moves along the straight line only by not moving along the straight line. In effect it "feels out" every conceivable world line that leads from the start, ( x , t ) x , t (x^('),t^('))\left(x^{\prime}, t^{\prime}\right)(x,t), to the point of detection, ( x , t ) x , t (x^(''),t^(''))\left(x^{\prime \prime}, t^{\prime \prime}\right)(x,t), "compares" one with another, and takes the extremal world line. How does it accomplish this miracle?
The particle is governed by a "probability amplitude to transit from ( x , t x , t x^('),t^(')x^{\prime}, t^{\prime}x,t ) to ( x , t ) x , t (x^(''),t^(''))\left(x^{\prime \prime}, t^{\prime \prime}\right)(x,t)." This amplitude or "propagator," x , t x , t x , t x , t (:x^(''),t^('')∣x^('),t^('):)\left\langle x^{\prime \prime}, t^{\prime \prime} \mid x^{\prime}, t^{\prime}\right\ranglex,tx,t, is the democratic sum with equal weight of contributions from every world line that leads from start to finish; thus,
(15) x , t x , t = N e i I H / od x . (15) x , t x , t = N e i I H /  od  x {:(15)(:x^(''),t^('')∣x^('),t^('):)=N inte^(iI_(H^('))//ℏ)" od "x". ":}\begin{equation*} \left\langle x^{\prime \prime}, t^{\prime \prime} \mid x^{\prime}, t^{\prime}\right\rangle=N \int e^{i I_{H^{\prime}} / \hbar} \text { od } x \text {. } \tag{15} \end{equation*}(15)x,tx,t=NeiIH/ od x
Here N N NNN is a normalization factor, the same for all histories.
D x D x Dx\mathscr{D} xDx is the "volume element" for the sum over histories. For a "skeleton history" defined by giving x n x n x_(n)x_{n}xn at t n = t 0 + n Δ t t n = t 0 + n Δ t t_(n)=t_(0)+n Delta tt_{n}=t_{0}+n \Delta ttn=t0+nΔt, one has D x D x Dx\mathscr{D} xDx equal, up to a multiplicative constant, to d x 1 d x 2 d x N d x 1 d x 2 d x N dx_(1)dx_(2)dots dx_(N)d x_{1} d x_{2} \ldots d x_{N}dx1dx2dxN. When the history is defined by the Fourier coefficients in such an expression as
(16) x ( t ) = x ( t t ) + x ( t t ) ( t t ) + n a n sin n π ( t t ) ( t t ) (16) x ( t ) = x t t + x t t t t + n a n sin n π t t t t {:(16)x(t)=(x^(')(t^('')-t)+x^('')(t-t^(')))/((t^('')-t^(')))+sum_(n)a_(n)sin n pi((t-t^(')))/((t^('')-t^('))):}\begin{equation*} x(t)=\frac{x^{\prime}\left(t^{\prime \prime}-t\right)+x^{\prime \prime}\left(t-t^{\prime}\right)}{\left(t^{\prime \prime}-t^{\prime}\right)}+\sum_{n} a_{n} \sin n \pi \frac{\left(t-t^{\prime}\right)}{\left(t^{\prime \prime}-t^{\prime}\right)} \tag{16} \end{equation*}(16)x(t)=x(tt)+x(tt)(tt)+nansinnπ(tt)(tt)
the volume element, again up to a multiplicative factor, is d a 1 d a 2 d a 1 d a 2 da_(1)da_(2)dotsd a_{1} d a_{2} \ldotsda1da2.
Destructive interference in effect wipes out the contribution to the transition probability from histories that differ significantly from the "extremal history" or "classical history." Histories that are near that extremal history, on the other hand, contribute constructively, and for a simple reason: a small departure of the first order from the classical history brings about a change in phase which is only of the second order in the departure.
In this elementary example, one sees illustrated why it is that extremal principles play such a large part in classical dynamics. They remind one that all classical physics rests on a foundation of quantum physics. The central ideas are (1) the principle

Box 21.1 (continued)

of superposition of probability amplitudes, (2) constructive and destructive interference, (3) the "democracy of all histories," and (4) the probability amplitude associated with a history H H HHH is e i I H / e i I H / e^(iI_(H)//ℏ)e^{i I_{H} / \hbar}eiIH/, apart from a normalizing factor that is a multiplicative constant.
For more on the democracy of histories and the sum over histories see Feynman (1942, 1948, 1949, 1951, and 1955), and the book of Feynman and Hibbs (1965); also Hibbs (1951), Morette (1951), Choquard (1955), Polkinghorne (1955), Fujiwara (1962), and the survey and literature references in Kursunoglu (1962); also reports of Dempster (1963) and Symanzik (1963). This outlook has been applied by many workers to discuss the quantum formulation of geometrodynamics, the first being Misner (1957) and one of the latest being Faddeev (1971).
the form δ Γ λ α β , μ δ Γ λ α β , μ deltaGamma^(lambda_(alpha beta,mu))\delta \Gamma^{\lambda_{\alpha \beta, \mu}}δΓλαβ,μ and four terms of the form Γ δ Γ Γ δ Γ Gamma delta Gamma\Gamma \delta \GammaΓδΓ (indices being dropped for simplicity). One coordinate system is as good as another in dealing with a tensor. Therefore pick a coordinate system in which all the Γ Γ Gamma\GammaΓ 's vanish at the point under study. The terms Γ δ Γ Γ δ Γ Gamma delta Gamma\Gamma \delta \GammaΓδΓ drop out. In this coordinate system, the variation of the curvature is expressed in terms of first derivatives of quantities like δ Γ α β λ δ Γ α β λ deltaGamma_(alpha beta)^(lambda)\delta \Gamma_{\alpha \beta}^{\lambda}δΓαβλ. One then need only replace the ordinary derivatives by covariant derivatives to obtain a formula correct in any coordinate system,
(21.20) δ R α μ β λ = δ Γ α β ; μ λ δ Γ α μ ; β λ (21.20) δ R α μ β λ = δ Γ α β ; μ λ δ Γ α μ ; β λ {:(21.20)deltaR_(alpha mu beta)^(lambda)=deltaGamma_(alpha beta;mu)^(lambda)-deltaGamma_(alpha mu;beta)^(lambda):}\begin{equation*} \delta R_{\alpha \mu \beta}^{\lambda}=\delta \Gamma_{\alpha \beta ; \mu}^{\lambda}-\delta \Gamma_{\alpha \mu ; \beta}^{\lambda} \tag{21.20} \end{equation*}(21.20)δRαμβλ=δΓαβ;μλδΓαμ;βλ
along with its contraction,
(21.21) δ R α β = δ Γ α β ; λ λ δ Γ α λ ; β λ (21.21) δ R α β = δ Γ α β ; λ λ δ Γ α λ ; β λ {:(21.21)deltaR_(alpha beta)=deltaGamma_(alpha beta;lambda)^(lambda)-deltaGamma_(alpha lambda;beta)^(lambda):}\begin{equation*} \delta R_{\alpha \beta}=\delta \Gamma_{\alpha \beta ; \lambda}^{\lambda}-\delta \Gamma_{\alpha \lambda ; \beta}^{\lambda} \tag{21.21} \end{equation*}(21.21)δRαβ=δΓαβ;λλδΓαλ;βλ
The third factor that appears in the variation principle is ( g ) 1 / 2 ( g ) 1 / 2 (-g)^(1//2)(-g)^{1 / 2}(g)1/2. Its variation (exercise 21.1) is
(21.22) δ ( g ) 1 / 2 = 1 2 ( g ) 1 / 2 g μ ν δ g μ ν (21.22) δ ( g ) 1 / 2 = 1 2 ( g ) 1 / 2 g μ ν δ g μ ν {:(21.22)delta(-g)^(1//2)=-(1)/(2)(-g)^(1//2)g_(mu nu)deltag^(mu nu):}\begin{equation*} \delta(-g)^{1 / 2}=-\frac{1}{2}(-g)^{1 / 2} g_{\mu \nu} \delta g^{\mu \nu} \tag{21.22} \end{equation*}(21.22)δ(g)1/2=12(g)1/2gμνδgμν
The other integrand, the Lagrange density L field L field  L_("field ")L_{\text {field }}Lfield , will depend on the fields present and their derivatives, but will be assumed to contain the metric only as g μ ν g μ ν g^(mu nu)g^{\mu \nu}gμν itself, never in the form of any derivatives of g μ ν g μ ν g^(mu nu)g^{\mu \nu}gμν.
In order for an extremum to exist, the following expression has to vanish:
( 1 / 16 π ) [ ( R α β 1 2 g α β R ) δ g α β + g α β ( δ Γ α β ; λ λ δ Γ α λ ; β λ ) ] ( g ) 1 / 2 d 4 x (21.23) + ( δ L field δ g α β 1 2 g α β L field ) δ g α β ( g ) 1 / 2 d 4 x = 0 ( 1 / 16 π ) R α β 1 2 g α β R δ g α β + g α β δ Γ α β ; λ λ δ Γ α λ ; β λ ( g ) 1 / 2 d 4 x (21.23) + δ L field δ g α β 1 2 g α β L field δ g α β ( g ) 1 / 2 d 4 x = 0 {:[(1//16 pi)int[(R_(alpha beta)-(1)/(2)g_(alpha beta)R)deltag^(alpha beta)+g^(alpha beta)(deltaGamma_(alpha beta;lambda)^(lambda)-deltaGamma_(alpha lambda;beta)^(lambda))](-g)^(1//2)d^(4)x],[(21.23)+int((deltaL_(field))/(deltag^(alpha beta))-(1)/(2)g_(alpha beta)L_(field))deltag^(alpha beta)(-g)^(1//2)d^(4)x=0]:}\begin{gather*} (1 / 16 \pi) \int\left[\left(R_{\alpha \beta}-\frac{1}{2} g_{\alpha \beta} R\right) \delta g^{\alpha \beta}+g^{\alpha \beta}\left(\delta \Gamma_{\alpha \beta ; \lambda}^{\lambda}-\delta \Gamma_{\alpha \lambda ; \beta}^{\lambda}\right)\right](-g)^{1 / 2} d^{4} x \\ +\int\left(\frac{\delta L_{\mathrm{field}}}{\delta g^{\alpha \beta}}-\frac{1}{2} g_{\alpha \beta} L_{\mathrm{field}}\right) \delta g^{\alpha \beta}(-g)^{1 / 2} d^{4} x=0 \tag{21.23} \end{gather*}(1/16π)[(Rαβ12gαβR)δgαβ+gαβ(δΓαβ;λλδΓαλ;βλ)](g)1/2d4x(21.23)+(δLfieldδgαβ12gαβLfield)δgαβ(g)1/2d4x=0
Focus attention on the term in (21.23) that contains the variations of Γ Γ Gamma\GammaΓ,
( 1 / 16 π ) g α β ( δ Γ α β ; λ λ δ Γ α λ ; β λ ) ( g ) 1 / 2 d 4 x ( 1 / 16 π ) g α β δ Γ α β ; λ λ δ Γ α λ ; β λ ( g ) 1 / 2 d 4 x (1//16 pi)intg^(alpha beta)(deltaGamma_(alpha beta;lambda)^(lambda)-deltaGamma_(alpha lambda;beta)^(lambda))(-g)^(1//2)d^(4)x(1 / 16 \pi) \int g^{\alpha \beta}\left(\delta \Gamma_{\alpha \beta ; \lambda}^{\lambda}-\delta \Gamma_{\alpha \lambda ; \beta}^{\lambda}\right)(-g)^{1 / 2} d^{4} x(1/16π)gαβ(δΓαβ;λλδΓαλ;βλ)(g)1/2d4x
and integrate by parts to eliminate the derivatives of the δ Γ δ Γ delta Gamma\delta \GammaδΓ. To prepare the way for this integration, introduce the concept of tensor density, a notational device widely applied in general relativity. The concept of tensor density aims at economy. Without this concept, one will treat the tensor
ϵ μ α β γ = ( g ) 1 / 2 [ μ α β γ ] ϵ μ α β γ = ( g ) 1 / 2 [ μ α β γ ] epsilon_(mu alpha beta gamma)=(-g)^(1//2)[mu alpha beta gamma]\epsilon_{\mu \alpha \beta \gamma}=(-g)^{1 / 2}[\mu \alpha \beta \gamma]ϵμαβγ=(g)1/2[μαβγ]
(see exercise 3.13) as having 4 4 = 256 4 4 = 256 4^(4)=2564^{4}=25644=256 components, and its covariant derivative as having 4 5 = 1 , 024 4 5 = 1 , 024 4^(5)=1,0244^{5}=1,02445=1,024 components, of which one is
ϵ 0123 ; ρ = ( g ) 1 / 2 / x ρ ϵ [ 0123 ] Γ 0 ρ σ ϵ σ 123 Γ 1 ρ σ ϵ 0 σ 23 Γ 2 ρ σ ϵ 0103 Γ 3 ρ σ ϵ 012 σ = [ ( g ) 1 / 2 , ρ Γ σ ρ σ ( g ) 1 / 2 ] [ 0123 ] . ϵ 0123 ; ρ = ( g ) 1 / 2 / x ρ ϵ [ 0123 ] Γ 0 ρ σ ϵ σ 123 Γ 1 ρ σ ϵ 0 σ 23 Γ 2 ρ σ ϵ 0103 Γ 3 ρ σ ϵ 012 σ = ( g ) 1 / 2 , ρ Γ σ ρ σ ( g ) 1 / 2 [ 0123 ] . {:[epsilon_(0123;rho)=del(-g)^(1//2)//delx^(rho)epsilon_([0123])-Gamma_(0rho)^(sigma)epsilon_(sigma123)-Gamma_(1rho)^(sigma)epsilon_(0sigma23)],[-Gamma_(2rho)^(sigma)epsilon_(0103)-Gamma_(3rho)^(sigma)epsilon_(012 sigma)],[=[(-g)^(1//2)_(,rho)-Gamma_(sigma rho)^(sigma)(-g)^(1//2)][0123].]:}\begin{aligned} \epsilon_{0123 ; \rho} & =\partial(-g)^{1 / 2} / \partial x^{\rho} \epsilon_{[0123]}-\Gamma_{0 \rho}^{\sigma} \epsilon_{\sigma 123}-\Gamma_{1 \rho}^{\sigma} \epsilon_{0 \sigma 23} \\ & -\Gamma_{2 \rho}^{\sigma} \epsilon_{0103}-\Gamma_{3 \rho}^{\sigma} \epsilon_{012 \sigma} \\ & =\left[(-g)^{1 / 2}{ }_{, \rho}-\Gamma_{\sigma \rho}^{\sigma}(-g)^{1 / 2}\right][0123] . \end{aligned}ϵ0123;ρ=(g)1/2/xρϵ[0123]Γ0ρσϵσ123Γ1ρσϵ0σ23Γ2ρσϵ0103Γ3ρσϵ012σ=[(g)1/2,ρΓσρσ(g)1/2][0123].
The symbol [ α β γ δ ] [ α β γ δ ] [alpha beta gamma delta][\alpha \beta \gamma \delta][αβγδ], with values ( 0 , 1 , + 1 ) ( 0 , 1 , + 1 ) (0,-1,+1)(0,-1,+1)(0,1,+1), introduces what is largely excess baggage, doing mere bookkeeping on alternating indices. Drop this unhandiness. Introduce instead the non-tensor ( g ) 1 / 2 ( g ) 1 / 2 (-g)^(1//2)(-g)^{1 / 2}(g)1/2 and define for it the law of covariant differentiation,
(21.24) ( g ) 1 / 2 = ( g ) 1 / 2 , ρ Γ σ ρ σ ( g ) 1 / 2 . (21.24) ( g ) 1 / 2 = ( g ) 1 / 2 , ρ Γ σ ρ σ ( g ) 1 / 2 . {:(21.24)(-g)^(1//2)=(-g)^(1//2)_(,rho)-Gamma_(sigma_(rho))^(sigma)(-g)^(1//2).:}\begin{equation*} (-g)^{1 / 2}=(-g)^{1 / 2}{ }_{, \rho}-\Gamma_{\sigma_{\rho}}^{\sigma}(-g)^{1 / 2} . \tag{21.24} \end{equation*}(21.24)(g)1/2=(g)1/2,ρΓσρσ(g)1/2.
These four components take the place of the 1,024 components and communicate all the important information that was in them.
Associated with the vector j μ j μ j_(mu)j_{\mu}jμ is the vector density
j μ = ( g ) 1 / 2 j μ j μ = ( g ) 1 / 2 j μ j_(mu)=(-g)^(1//2)j_(mu)j_{\mu}=(-g)^{1 / 2} j_{\mu}jμ=(g)1/2jμ
with the tensor T μ ν T μ ν T_(mu nu)T_{\mu \nu}Tμν, the tensor density
C μ ν = ( g ) 1 / 2 T μ ν ; C μ ν = ( g ) 1 / 2 T μ ν ; C_(mu nu)=(-g)^(1//2)T_(mu nu);\mathfrak{C}_{\mu \nu}=(-g)^{1 / 2} T_{\mu \nu} ;Cμν=(g)1/2Tμν;
and so on; the German gothic letter is a standard indicator for the presence of the factor ( g ) 1 / 2 ( g ) 1 / 2 (-g)^(1//2)(-g)^{1 / 2}(g)1/2. On some occasions (see, for example, §21.11) it is convenient to multiply the components of a tensor with a power of ( g ) 1 / 2 ( g ) 1 / 2 (-g)^(1//2)(-g)^{1 / 2}(g)1/2 other than 1 . According to the value of the exponent, the resulting assemblage of components is then called a tensor density of this or that weight.
The law of differentiation of an ordinary or standard tensor density formed from a tensor of arbitrary order,
A = ( g ) 1 / 2 A , A = ( g ) 1 / 2 A , A_(cdots)*=(-g)^(1//2)A_(cdots)^(cdots),\mathfrak{A}_{\cdots} \cdot=(-g)^{1 / 2} A_{\cdots}^{\cdots},A=(g)1/2A,
is
(๙...) ) ; ρ = ( 凡... ) , ρ + ( standard Γ . . . terms of a standard covariant derivative multiplied into ๙...) - (ঞ..) Γ σ ρ σ .  (๙...)  ) ; ρ = (  凡...  ) , ρ +  standard  Γ . . . terms of a standard covariant   derivative multiplied into ๙...) - (ঞ..)  Γ σ ρ σ {:[" (๙...) ")_(;rho)=(" 凡... ")_(,rho)+(" standard "Gamma_(...):}"terms of a standard covariant "],[" derivative multiplied into ๙...) - (ঞ..) "Gamma_(sigma rho)^(sigma)". "]:}\begin{aligned} & \text { (๙...) })_{; \rho}=(\text { 凡... })_{, \rho}+\left(\text { standard } \Gamma_{. . .}\right. \text {terms of a standard covariant } \\ & \text { derivative multiplied into ๙...) - (ঞ..) } \Gamma_{\sigma \rho}^{\sigma} \text {. } \end{aligned} (๙...) );ρ=( 凡... ),ρ+( standard Γ...terms of a standard covariant  derivative multiplied into ๙...) - (ঞ..) Γσρσ
The covariant derivative of a product is the sum of two terms: the covariant deriva-
tive of the first, times the second, plus the first times the covariant derivative of the second.
Now return to the integral to be evaluated. Combine the factors g α β g α β g^(alpha beta)g^{\alpha \beta}gαβ and ( g ) 1 / 2 ( g ) 1 / 2 (-g)^(1//2)(-g)^{1 / 2}(g)1/2 into the tensor density g α β g α β g^(alpha beta)\mathfrak{g}^{\alpha \beta}gαβ. Integrate covariantly by parts, as justified by the rule for the covariant derivative of a product. Get a "term at limits," plus the integral
( 1 / 16 π ) ( g α β ; λ δ λ β g α γ ; γ ) δ Γ α β λ d 4 x . ( 1 / 16 π ) g α β ; λ δ λ β g α γ ; γ δ Γ α β λ d 4 x . -(1//16 pi)int(g^(alpha beta)_(;lambda)-delta_(lambda)^(beta)g^(alpha gamma)_(;gamma))deltaGamma_(alpha beta)^(lambda)d^(4)x.-(1 / 16 \pi) \int\left(\mathfrak{g}^{\alpha \beta}{ }_{; \lambda}-\delta_{\lambda}^{\beta} \mathfrak{g}^{\alpha \gamma}{ }_{; \gamma}\right) \delta \Gamma_{\alpha \beta}^{\lambda} d^{4} x .(1/16π)(gαβ;λδλβgαγ;γ)δΓαβλd4x.
This integral is the only term in the action integral that contains the variations of the Γ Γ Gamma\GammaΓ 's at the "interior points" of interest here. For the integral to be an extremum, the symmetrized coefficient of δ Γ α β λ δ Γ α β λ deltaGamma_(alpha beta)^(lambda)\delta \Gamma_{\alpha \beta}^{\lambda}δΓαβλ must vanish,
g α β ; λ 1 2 δ λ α g β γ ; γ 1 2 δ λ β g α γ ; γ = 0 . g α β ; λ 1 2 δ λ α g β γ ; γ 1 2 δ λ β g α γ ; γ = 0 . g^(alpha beta)_(;lambda)-(1)/(2)delta_(lambda)^(alpha)g^(beta gamma)_(;gamma)-(1)/(2)delta_(lambda)^(beta)g^(alpha gamma)_(;gamma)=0.\mathfrak{g}^{\alpha \beta}{ }_{; \lambda}-\frac{1}{2} \delta_{\lambda}^{\alpha} \mathfrak{g}^{\beta \gamma}{ }_{; \gamma}-\frac{1}{2} \delta_{\lambda}^{\beta} \mathfrak{g}^{\alpha \gamma}{ }_{; \gamma}=0 .gαβ;λ12δλαgβγ;γ12δλβgαγ;γ=0.
This set of forty equations for the forty covariant derivative g α β ; λ g α β ; λ g^(alpha beta)_(;lambda)\mathfrak{g}^{\alpha \beta}{ }_{; \lambda}gαβ;λ has only the zero solution,
(21.25) g α β ; λ = 0 . (21.25) g α β ; λ = 0 . {:(21.25)g^(alpha beta)_(;lambda)=0.:}\begin{equation*} \mathfrak{g}^{\alpha \beta}{ }_{; \lambda}=0 . \tag{21.25} \end{equation*}(21.25)gαβ;λ=0.
Thus the "density formed from the reciprocal metric tensor" is covariantly constant.
This simple result (1) brings many simple results in its train: the covariant constancy of ( 2 ) ( g ) 1 / 2 ( 2 ) ( g ) 1 / 2 (2)(-g)^(1//2)(2)(-g)^{1 / 2}(2)(g)1/2, (3) g α β g α β g^(alpha beta)g^{\alpha \beta}gαβ, (4) g α β g α β g_(alpha beta)g_{\alpha \beta}gαβ, and (5) g α β g α β g_(alpha beta)\mathfrak{g}_{\alpha \beta}gαβ. Of these, (4) is of special interest here, and (2) is needed in proving it, as follows. Take definition (21.24) for the covariant derivative of ( g ) 1 / 2 ( g ) 1 / 2 (-g)^(1//2)(-g)^{1 / 2}(g)1/2, and calculate the ordinary derivative that appears in the first term from exercise 21.1. One encounters in this calculation terms of the form g α β / x λ g α β / x λ delg^(alpha beta)//delx^(lambda)\partial \mathfrak{g}^{\alpha \beta} / \partial x^{\lambda}gαβ/xλ. Use (21.25) to evaluate them, and end up with the result
( g ) 1 / 2 ; λ = 0 ( g ) 1 / 2 ; λ = 0 (-g)^(1//2)_(;lambda)=0(-g)^{1 / 2}{ }_{; \lambda}=0(g)1/2;λ=0
From this result it follows that the covariant derivative of the ( 1 1 ) ( 1 1 ) ((1)/(1))\binom{1}{1}(11)-tensor density ( g ) 1 / 2 δ γ α ( g ) 1 / 2 δ γ α (-g)^(1//2)delta_(gamma)^(alpha)(-g)^{1 / 2} \delta_{\gamma}^{\alpha}(g)1/2δγα is also zero. But this tensor density is the product of the tensor density g α β g α β g^(alpha beta)\mathrm{g}^{\alpha \beta}gαβ by the ordinary metric tensor g β γ g β γ g_(beta gamma)g_{\beta \gamma}gβγ. In the covariant derivative of this product by x λ x λ x^(lambda)x^{\lambda}xλ, one already knows that the derivative of the first factor is zero. Therefore the first factor times the derivative of the second must be zero,
g α β g β γ ; λ = 0 , g α β g β γ ; λ = 0 , g^(alpha beta)g_(beta gamma;lambda)=0,\mathfrak{g}^{\alpha \beta} g_{\beta \gamma ; \lambda}=0,gαβgβγ;λ=0,
and from this it follows that
(21.26) g β γ ; λ = 0 , (21.26) g β γ ; λ = 0 , {:(21.26)g_(beta gamma;lambda)=0",":}\begin{equation*} g_{\beta \gamma ; \lambda}=0, \tag{21.26} \end{equation*}(21.26)gβγ;λ=0,
as was to be proven; or, explicitly,
g β γ x λ g γ σ Γ β λ σ g β σ Γ γ λ σ = 0 . g β γ x λ g γ σ Γ β λ σ g β σ Γ γ λ σ = 0 . (delg_(beta gamma))/(delx^(lambda))-g_(gamma sigma)Gamma_(beta lambda)^(sigma)-g_(beta sigma)Gamma_(gamma lambda)^(sigma)=0.\frac{\partial g_{\beta \gamma}}{\partial x^{\lambda}}-g_{\gamma \sigma} \Gamma_{\beta \lambda}^{\sigma}-g_{\beta \sigma} \Gamma_{\gamma \lambda}^{\sigma}=0 .gβγxλgγσΓβλσgβσΓγλσ=0.
Solve these equations for the Γ Γ Gamma\GammaΓ 's, which up to now have been independent of the g β γ g β γ g_(beta gamma)g_{\beta \gamma}gβγ, and end up with the standard equation for the connection coefficients,
(21.27) Γ μ ν ρ = 1 2 g ρ σ ( g μ σ , ν + g σ ν , μ g μ ν , σ ) , (21.27) Γ μ ν ρ = 1 2 g ρ σ g μ σ , ν + g σ ν , μ g μ ν , σ , {:(21.27)Gamma_(mu nu)^(rho)=(1)/(2)g^(rho sigma)(g_(mu sigma,nu)+g_(sigma nu,mu)-g_(mu nu,sigma))",":}\begin{equation*} \Gamma_{\mu \nu}^{\rho}=\frac{1}{2} g^{\rho \sigma}\left(g_{\mu \sigma, \nu}+g_{\sigma \nu, \mu}-g_{\mu \nu, \sigma}\right), \tag{21.27} \end{equation*}(21.27)Γμνρ=12gρσ(gμσ,ν+gσν,μgμν,σ),
as required for Riemannian geometry.
Similarly, equate to zero the coefficient of δ g α β δ g α β deltag^(alpha beta)\delta g^{\alpha \beta}δgαβ in the variation (21.23), and find all ten components of Einstein's field equation, in the form
(21.28) G α β = 8 π ( g α β L field 2 δ L field δ g α β ) { identified in § 21.3 with the stress-energy tensor T α β ] . (21.28) G α β = 8 π g α β L field  2 δ L field  δ g α β  identified in  § 21.3  with   the stress-energy tensor  T α β . {:(21.28)G_(alpha beta)=8piubrace((g_(alpha beta)L_("field ")-2(deltaL_("field "))/(deltag^(alpha beta)))ubrace)_({[" identified in "§21.3" with "],[" the stress-energy tensor "T_(alpha beta)]]).:}G_{\alpha \beta}=8 \pi \underbrace{\left(g_{\alpha \beta} L_{\text {field }}-2 \frac{\delta L_{\text {field }}}{\delta g^{\alpha \beta}}\right)}_{\left\{\begin{array}{l} \text { identified in } \S 21.3 \text { with } \tag{21.28}\\ \text { the stress-energy tensor } T_{\alpha \beta} \end{array}\right]} .(21.28)Gαβ=8π(gαβLfield 2δLfield δgαβ){ identified in §21.3 with  the stress-energy tensor Tαβ].
Among variations of the metric, one of the simplest is the change
(21.29) g new μ ν = g μ ν + δ g μ ν = g μ ν + ξ μ ; ν + ξ ν ; μ (21.29) g new  μ ν = g μ ν + δ g μ ν = g μ ν + ξ μ ; ν + ξ ν ; μ {:(21.29)g_("new "mu nu)=g_(mu nu)+deltag_(mu nu)=g_(mu nu)+xi_(mu;nu)+xi_(nu;mu):}\begin{equation*} g_{\text {new } \mu \nu}=g_{\mu \nu}+\delta g_{\mu \nu}=g_{\mu \nu}+\xi_{\mu ; \nu}+\xi_{\nu ; \mu} \tag{21.29} \end{equation*}(21.29)gnew μν=gμν+δgμν=gμν+ξμ;ν+ξν;μ
brought about by the infinitesimal coordinate transformation
(21.30) x new μ = x μ ξ μ (21.30) x new μ = x μ ξ μ {:(21.30)x_(new)^(mu)=x^(mu)-xi^(mu):}\begin{equation*} x_{\mathrm{new}}^{\mu}=x^{\mu}-\xi^{\mu} \tag{21.30} \end{equation*}(21.30)xnewμ=xμξμ
Although the metric changes, the 3-geometry does not. It does not matter whether the spacetime geometry that one is dealing with extremizes the action principle or not, whether it is a solution of Einstein's equations or not; the action integral I I III is a scalar invariant, a number, the value of which depends on the physics but not at all on the system of coordinates in which that physics is expressed. This invariance even obtains for both parts of the action principle individually ( I geom I geom  I_("geom ")I_{\text {geom }}Igeom  and I fields I fields  I_("fields ")I_{\text {fields }}Ifields  ). Therefore neither part will be affected in value by the variation (21.29). In other words, the quantity
(21.31) δ I geom = ( 1 / 16 π ) G α β ( ξ α ; β + ξ β ; α ) ( g ) 1 / 2 d 4 x ¯ ( 1 / 8 π ) G α β ; β ξ α ( g ) 1 / 2 d 4 x [ "covariant integration by parts"] (21.31) δ I geom  = ( 1 / 16 π ) G α β ξ α ; β + ξ β ; α ( g ) 1 / 2 d 4 x ¯ ¯ ( 1 / 8 π ) G α β ; β ξ α ( g ) 1 / 2 d 4 x [  "covariant integration by parts"]  {:[(21.31)deltaI_("geom ")=(1//16 pi)intG_(alpha beta)(xi^(alpha;beta)+xi^(beta;alpha))(-g)^(1//2)d^(4)x],[ bar(bar(uarr))-(1//8pi)intG_(alpha beta)^(;betaxi^(alpha))(-g)^(1//2)d^(4)x],[quad[" "covariant integration by parts"] "]:}\begin{align*} \delta I_{\text {geom }} & =(1 / 16 \pi) \int G_{\alpha \beta}\left(\xi^{\alpha ; \beta}+\xi^{\beta ; \alpha}\right)(-g)^{1 / 2} d^{4} x \tag{21.31}\\ & \overline{\bar{\uparrow}}-(1 / 8 \pi) \int G_{\alpha \beta}^{; \beta \xi^{\alpha}}(-g)^{1 / 2} d^{4} x \\ & \quad[\text { "covariant integration by parts"] } \end{align*}(21.31)δIgeom =(1/16π)Gαβ(ξα;β+ξβ;α)(g)1/2d4x¯(1/8π)Gαβ;βξα(g)1/2d4x[ "covariant integration by parts"] 
must vanish whatever the 4 -geometry and whatever the change ξ α ξ α xi^(alpha)\xi^{\alpha}ξα. In this way, one sees from a new angle the contracted Bianchi identities of Chapter 15,
(21.32) G α β ; β = 0 (21.32) G α β ; β = 0 {:(21.32)G_(alpha beta)^(;beta)=0:}\begin{equation*} G_{\alpha \beta}^{; \beta}=0 \tag{21.32} \end{equation*}(21.32)Gαβ;β=0
The "neutrality" of the action principle with respect to a mere coordinate transformation such as ( 21.29 ) ( 21.29 ) (21.29)(21.29)(21.29) shows once again that the variational principle-and with it Einstein's equation-cannot determine the coordinates or the metric, but only the 4-geometry itself.

Exercise 21.1. VARIATION OF THE DETERMINANT OF THE METRIC TENSOR

EXERCISE

Recalling that the change in the value of any determinant is given by multiplying the change in each element of that determinant by its cofactor and adding the resulting products (exercise 5.5) prove that
δ ( g ) 1 / 2 = 1 2 ( g ) 1 / 2 g μ ν δ g μ ν and δ ( g ) 1 / 2 = 1 2 ( g ) 1 / 2 g μ ν δ g μ ν δ ( g ) 1 / 2 = 1 2 ( g ) 1 / 2 g μ ν δ g μ ν  and  δ ( g ) 1 / 2 = 1 2 ( g ) 1 / 2 g μ ν δ g μ ν delta(-g)^(1//2)=(1)/(2)(-g)^(1//2)g^(mu nu)deltag_(mu nu)quad" and "quad delta(-g)^(1//2)=-(1)/(2)(-g)^(1//2)g_(mu nu)deltag^(mu nu)\delta(-g)^{1 / 2}=\frac{1}{2}(-g)^{1 / 2} g^{\mu \nu} \delta g_{\mu \nu} \quad \text { and } \quad \delta(-g)^{1 / 2}=-\frac{1}{2}(-g)^{1 / 2} g_{\mu \nu} \delta g^{\mu \nu}δ(g)1/2=12(g)1/2gμνδgμν and δ(g)1/2=12(g)1/2gμνδgμν
Also show that
g = det g μ ν and δ ( g ) 1 / 2 = + 1 2 g μ ν δ g μ ν g = det g μ ν  and  δ ( g ) 1 / 2 = + 1 2 g μ ν δ g μ ν g=det||g^(mu nu)||quad" and "quad delta(-g)^(1//2)=+(1)/(2)g_(mu nu)deltag^(mu nu)g=\operatorname{det}\left\|\mathfrak{g}^{\mu \nu}\right\| \quad \text { and } \quad \delta(-g)^{1 / 2}=+\frac{1}{2} g_{\mu \nu} \delta \mathfrak{g}^{\mu \nu}g=detgμν and δ(g)1/2=+12gμνδgμν
Action unaffected by mere change in coordinatization
Lagrangian generates stress-energy tensor
Electromagnetism as an example
Contrast to stress-energy tensor of "canonical field theory"

§21.3. MATTER LAGRANGIAN AND STRESS-ENERGY TENSOR

The derivation of Einstein's geometrodynamic law from Hilbert's action principle puts on the righthand side a source term that is derived from the field Lagrangian. In contrast, the derivation of Chapter 17 identified the source term with the stressenergy tensor of the field. For the two derivations to be compatible, the stress-energy tensor must be given by the expression
(21.33a) T α β = 2 δ L field δ g α β + g α β L field , (21.33a) T α β = 2 δ L field  δ g α β + g α β L field  , {:(21.33a)T_(alpha beta)=-2(deltaL_("field "))/(deltag^(alpha beta))+g_(alpha beta)L_("field ")",":}\begin{equation*} T_{\alpha \beta}=-2 \frac{\delta L_{\text {field }}}{\delta g^{\alpha \beta}}+g_{\alpha \beta} L_{\text {field }}, \tag{21.33a} \end{equation*}(21.33a)Tαβ=2δLfield δgαβ+gαβLfield ,
or
(21.33b) ( g ) 1 / 2 T α β V α β = 2 δ L field δ g α β (21.33b) ( g ) 1 / 2 T α β V α β = 2 δ L field  δ g α β {:(21.33b)(-g)^(1//2)T^(alpha beta)-=V^(alpha beta)=2(deltaL_("field "))/(deltag_(alpha beta)):}\begin{equation*} (-g)^{1 / 2} T^{\alpha \beta} \equiv \mathbb{V}^{\alpha \beta}=2 \frac{\delta \mathcal{L}_{\text {field }}}{\delta g_{\alpha \beta}} \tag{21.33b} \end{equation*}(21.33b)(g)1/2TαβVαβ=2δLfield δgαβ
What are the consequences of this identification?
By the term "Lagrange function of the field" as employed here, one means the Lagrange function of the classical theory as formulated in flat spacetime, with the flat-spacetime metric replaced wherever it appears by the actual metric, and with the "comma-goes-to-semicolon rule" of Chapter 16 applied to all derivatives.
Were one dealing with a general tensorial field, the comma-goes-to-semicolon rule would introduce, in addition to the derivative of the tensorial field with all its indices, a number of Γ Γ Gamma^(')\Gamma^{\prime}Γ 's equal to the number of indices. The presence of these Γ Γ Gamma^(')\Gamma^{\prime}Γ 's in the field Lagrangian would have unhappy consequences for the Palatini variational procedure described in $ 21.2 $ 21.2 $21.2\$ 21.2$21.2. No longer would the Γ Γ Gamma\GammaΓ 's end up given in terms of the metric coefficients by the standard formula (21.27). No longer would the geometry, as derived from the Hilbert-Palatini variation principle, be Riemannian. Then what?
These troublesome issues do not arise in two well-known simple cases, a scalar field and an electromagnetic field. In the one case, the field Lagrangian becomes
(21.34) L field = ( 1 / 8 π ) [ g α β ( ϕ / x α ) ( ϕ / x β ) m 2 ϕ 2 ] . (21.34) L field  = ( 1 / 8 π ) g α β ϕ / x α ϕ / x β m 2 ϕ 2 . {:(21.34)L_("field ")=(1//8pi)[-g^(alpha beta)(del phi//delx^(alpha))(del phi//delx^(beta))-m^(2)phi^(2)].:}\begin{equation*} L_{\text {field }}=(1 / 8 \pi)\left[-g^{\alpha \beta}\left(\partial \phi / \partial x^{\alpha}\right)\left(\partial \phi / \partial x^{\beta}\right)-m^{2} \phi^{2}\right] . \tag{21.34} \end{equation*}(21.34)Lfield =(1/8π)[gαβ(ϕ/xα)(ϕ/xβ)m2ϕ2].
No connection coefficient comes in; the quantity being differentiated is a scalar. In the other case, the field Lagrangian is built on first derivatives of the 4 -potential A μ A μ A_(mu)A_{\mu}Aμ. Therefore Γ Γ Gamma\GammaΓ 's should appear, according to the standard rules for covariant differentiation (Box 8.4). However, the derivatives of the A A AAA 's appear, never alone, but always in an antisymmetric combination where the Γ Γ Gamma\GammaΓ 's cancel, making covariant derivatives equivalent to ordinary derivatives:
(21.35) F μ ν = A ν ; μ A μ ; v = A v , μ A μ , v . (21.35) F μ ν = A ν ; μ A μ ; v = A v , μ A μ , v . {:(21.35)F_(mu nu)=A_(nu;mu)-A_(mu;v)=A_(v,mu)-A_(mu,v).:}\begin{equation*} F_{\mu \nu}=A_{\nu ; \mu}-A_{\mu ; v}=A_{v, \mu}-A_{\mu, v} . \tag{21.35} \end{equation*}(21.35)Fμν=Aν;μAμ;v=Av,μAμ,v.
In both cases, the differentiations of (21.33) to generate the stress-energy tensor are easily carried out (exercises 21.2 and 21.3) and give the standard expressions already seen [(5.22) and (5.23)] for T μ ν T μ ν T_(mu nu)T_{\mu \nu}Tμν in one of these two cases in an earlier chapter.
Field theory provides a quite other method to generate a so-called canonical expression for the stress-energy tensor of a field [see, for example, Wentzel (1949)].
By the very manner of construction, such an expression is guaranteed also to satisfy the law of conservation of momentum and energy, and by this circumstance it too becomes useful in certain contexts. However, the canonical tensor is often not symmetric in its two indices, and in such cases violates the law of conservation of angular momentum (see discussion in §5.7). Even when symmetric, it may give a quite different localization of stress and energy than that given by (21.33). Field theory in and by itself is unable to decide between these different pictures of where the field energy is localized. However, direct measurements of the pull of gravitation provide in principle [see, for example, Feynman (1964)] a means to distinguish between alternative prescriptions for the localization of stress-energy, because gravitation responds directly to density of mass-energy and momentum. It is therefore a happy circumstance that the theory of gravity in the variational formulation gives a unique prescription for fixing the stress-energy tensor, a prescription that, besides being symmetric, also automatically satisfies the laws of conservation of momentum and energy (exercises 21.2 and 21.3). [For an early discussion of the symmetrization of the stress-energy tensor, see Rosenfeld (1940) and Belinfante (1940). A more extensive discussion is given by Corson (1953) and Davis (1970), along with extensive references to the literature.]
When one deals with a spinor field, one finds it convenient to take as the quantities to be varied, not the metric coefficients themselves, but the components of a tetrad of orthonormal vectors defined as a tetrad field over all space [see Davis (1970) for discussion and references].

Exercise 21.2. STRESS-ENERGY TENSOR FOR A SCALAR FIELD

EXERCISES

Given the Lagrange function (21.34) of a scalar field, derive the stress-energy tensor for this field. Also write down the field equation for the scalar field that one derives from this Lagrange function (in the general case where the field executes its dynamics within the arena of a curved spacetime). Show that as a consequence of this field equation, the stress-energy tensor satisfies the conservation law, T α β ; β = 0 T α β ; β = 0 T_(alpha beta);beta=0T_{\alpha \beta} ; \beta=0Tαβ;β=0.

Exercise 21.3. FARADAY-MAXWELL STRESS-ENERGY TENSOR

Given the Lagrangian density F μ ν F μ ν / 16 π F μ ν F μ ν / 16 π -F_(mu nu)F^(mu nu)//16 pi-F_{\mu \nu} F^{\mu \nu} / 16 \piFμνFμν/16π, reexpress it in terms of the variables A μ A μ A_(mu)A_{\mu}Aμ and g μ ν g μ ν g^(mu nu)g^{\mu \nu}gμν, and by use of (21.33) derive the stress-energy tensor as discussed in §5.6. Also derive from the Lagrange variation principle the field equation F α β ; β = 0 F α β ; β = 0 F_(alpha beta);beta=0F_{\alpha \beta} ; \beta=0Fαβ;β=0 (curved spacetime, but-for simplicity-a charge-free region of space). As a consequence of this field equation, show that the Faraday-Maxwell stress-energy tensor satisfies the conservation law, T α β ; β = 0 T α β ; β = 0 T_(alpha beta)^(;beta)=0T_{\alpha \beta}{ }^{; \beta}=0Tαβ;β=0. For a more ambitious project, show that any stress-energy tensor derived from a field Lagrangian by the prescription of equation (21.33) will automatically satisfy the conservation law T α β ; β = 0 T α β ; β = 0 T_(alpha beta)^(;beta)=0T_{\alpha \beta}^{; \beta}=0Tαβ;β=0.

§21.4. SPLITTING SPACETIME INTO SPACE AND TIME

There are many ways to "push forward" many-fingered time and explore spacetime faster here and slower there, or faster there and slower here. However, a computer is most efficiently programmed only when it follows one definite prescription. The
Figure 21.2.
Building two 3 -geometries into a thin sandwich 4 -geometry, by interposing perpendicular connectors between the two, with preassigned lengths and shifts. What would otherwise be flexible thereupon becomes rigid. The flagged point illustrates equation (21.40).
Slice spacetime to compute spacetime
Thin sandwich 4-geometry
successive hypersurfaces on which it gives the geometry are most conveniently described by successive values of a time-parameter t t ttt. One treats on a different footing the 3-geometries of these hypersurfaces and the 4-geometry that fills in between these laminations.
The slicing of spacetime into a one-parameter family of spacelike hypersurfaces is called for, not only by the analysis of the dynamics along the way, but also by the boundary conditions as they pose themselves in any action principle of the form, "Give the 3 -geometries on the two faces of a sandwich of spacetime, and adjust the 4 -geometry in between to extremize the action."
There is no simpler sandwich to consider than one of infinitesimal thickness (Figure 21.2). Choosing coordinates adapted to the ( 3 + 1 ) ( 3 + 1 ) (3+1)(3+1)(3+1)-space-time split, designate the "lower" (earlier) hypersurface in the diagram as t = t = t=t=t= constant and the "upper" (later) one as t + d t = t + d t = t+dt=t+d t=t+dt= constant (names, only names; no direct measure whatsoever of proper time). Compare the two hypersurfaces with two ribbons of steel out of which one wants to construct a rigid structure. To give the geometry on the two ribbons by no means fixes this structure; for that purpose, one needs cross-connectors between the one ribbon and the other. It is not even enough (1) to specify that these connectors are to be welded on perpendicular to the lower ribbon; (2) to specify where each is to be welded; and (3) to give its length. One must in addition tell where each connector joins the upper surface. If the proper distances between tops of the connectors are everywhere shorter than the distances between the bases of the connectors, the double ribbon will have the curve of the cable of a suspension bridge; if everywhere longer, the curve of the arch of a masonry bridge. The data necessary for the construction of the sandwich are thus (1) the metric of the 3-geometry of the lower hypersurface,
(21.36) g i j ( t , x , y , z ) d x i d x j (21.36) g i j ( t , x , y , z ) d x i d x j {:(21.36)g_(ij)(t","x","y","z)dx^(i)dx^(j):}\begin{equation*} g_{i j}(t, x, y, z) d x^{i} d x^{j} \tag{21.36} \end{equation*}(21.36)gij(t,x,y,z)dxidxj
telling the (distance) 2 2 ^(2){ }^{2}2 between one point in that hypersurface and another; (2) the metric on the upper hypersurface,
(21.37) g i j ( t + d t , x , y , z ) d x i d x j (21.37) g i j ( t + d t , x , y , z ) d x i d x j {:(21.37)g_(ij)(t+dt","x","y","z)dx^(i)dx^(j):}\begin{equation*} g_{i j}(t+d t, x, y, z) d x^{i} d x^{j} \tag{21.37} \end{equation*}(21.37)gij(t+dt,x,y,z)dxidxj
(3) a formula for the proper length,
(21.38) ( lapse of proper time between lower and upper hypersurface ) = ( "lapse function" ) d t = N ( t , x , y , z ) d t (21.38)  lapse of   proper time   between lower   and upper   hypersurface  = (  "lapse   function"  ) d t = N ( t , x , y , z ) d t {:(21.38)([" lapse of "],[" proper time "],[" between lower "],[" and upper "],[" hypersurface "])=((" "lapse ")/(" function" "))dt=N(t","x","y","z)dt:}\left(\begin{array}{l} \text { lapse of } \tag{21.38}\\ \text { proper time } \\ \text { between lower } \\ \text { and upper } \\ \text { hypersurface } \end{array}\right)=\binom{\text { "lapse }}{\text { function" }} d t=N(t, x, y, z) d t(21.38)( lapse of  proper time  between lower  and upper  hypersurface )=( "lapse  function" )dt=N(t,x,y,z)dt
of the connector that is based on the point ( x , y , z ) ( x , y , z ) (x,y,z)(x, y, z)(x,y,z) of the lower hypersurface; and (4) a formula for the place on the upper hypersurface,
(21.39) x upper i ( x m ) = x i N i ( t , x , y , z ) d t (21.39) x upper i x m = x i N i ( t , x , y , z ) d t {:(21.39)x_(upper)^(i)(x^(m))=x^(i)-N^(i)(t","x","y","z)dt:}\begin{equation*} x_{\mathrm{upper}}^{i}\left(x^{m}\right)=x^{i}-N^{i}(t, x, y, z) d t \tag{21.39} \end{equation*}(21.39)xupperi(xm)=xiNi(t,x,y,z)dt
where this connector is to be welded. Omit part of this information, and find the structure deprived of rigidity.
The rigidity of the structure of the thin sandwich is most immediately revealed in the definiteness of the 4 -geometry of the spacetime filling of the sandwich. Ask for the proper interval d s d s dsd sds or d τ d τ d taud \taudτ between x α = ( t , x i ) x α = t , x i x^(alpha)=(t,x^(i))x^{\alpha}=\left(t, x^{i}\right)xα=(t,xi) and x α + d x α = x α + d x α = x^(alpha)+dx^(alpha)=x^{\alpha}+d x^{\alpha}=xα+dxα= ( t + d t , x i + d x i ) t + d t , x i + d x i (t+dt,x^(i)+dx^(i))\left(t+d t, x^{i}+d x^{i}\right)(t+dt,xi+dxi). The Pythagorean theorem in its 4 -dimensional form
d s 2 = ( proper distance in base 3-geometry ) 2 ( proper time from lower to upper 3-geometry ) 2 d s 2 = (  proper distance   in base 3-geometry  ) 2 (  proper time from   lower to upper 3-geometry  ) 2 ds^(2)=((" proper distance ")/(" in base 3-geometry "))^(2)-((" proper time from ")/(" lower to upper 3-geometry "))^(2)d s^{2}=\binom{\text { proper distance }}{\text { in base 3-geometry }}^{2}-\binom{\text { proper time from }}{\text { lower to upper 3-geometry }}^{2}ds2=( proper distance  in base 3-geometry )2( proper time from  lower to upper 3-geometry )2
yields the result (see Figure 21.2).
(21.40) d s 2 = g i j ( d x i + N i d t ) ( d x j + N j d t ) ( N d t ) 2 (21.40) d s 2 = g i j d x i + N i d t d x j + N j d t ( N d t ) 2 {:(21.40)ds^(2)=g_(ij)(dx^(i)+N^(i)dt)(dx^(j)+N^(j)dt)-(Ndt)^(2):}\begin{equation*} d s^{2}=g_{i j}\left(d x^{i}+N^{i} d t\right)\left(d x^{j}+N^{j} d t\right)-(N d t)^{2} \tag{21.40} \end{equation*}(21.40)ds2=gij(dxi+Nidt)(dxj+Njdt)(Ndt)2
Here as in (21.36) the g i j g i j g_(ij)g_{i j}gij are the metric coefficients of the 3-geometry, distinguished by their Latin labels from the Greek-indexed components of the 4 -metric,
(21.41) d s 2 = ( 4 ) g α β d x α d x β (21.41) d s 2 = ( 4 ) g α β d x α d x β {:(21.41)ds^(2)=^((4))g_(alpha beta)dx^(alpha)dx^(beta):}\begin{equation*} d s^{2}={ }^{(4)} g_{\alpha \beta} d x^{\alpha} d x^{\beta} \tag{21.41} \end{equation*}(21.41)ds2=(4)gαβdxαdxβ
labeled here with a suffix ( 4 ) ( 4 ) ^((4)){ }^{(4)}(4) to reduce the possibility of confusion. Comparing (21.41) and (21.40), one arrives at the following construction of the 4 -metric out of the 3-metric and the lapse and shift functions [Arnowitt, Deser, and Misner (1962)]:
(21.42) ( 4 ) g 00 ( 4 ) g 0 k ( 4 ) g i 0 ( 4 ) g i k = ( N s N s N 2 ) N k N i g i k . (21.42) ( 4 ) g 00 ( 4 ) g 0 k ( 4 ) g i 0 ( 4 ) g i k = N s N s N 2 N k N i g i k . {:(21.42)||[(4)],[g_(00),^((4))g_(0k)],[(4)g_(i0),^((4))g_(ik)]||=||[(N_(s)N^(s)-N^(2)),N_(k)],[N_(i),g_(ik)]||.:}\left\|\begin{array}{cc} (4) \tag{21.42}\\ g_{00} & { }^{(4)} g_{0 k} \\ (4) g_{i 0} & { }^{(4)} g_{i k} \end{array}\right\|=\left\|\begin{array}{cc} \left(N_{s} N^{s}-N^{2}\right) & N_{k} \\ N_{i} & g_{i k} \end{array}\right\| .(21.42)(4)g00(4)g0k(4)gi0(4)gik=(NsNsN2)NkNigik.
The welded connectors do the job!
In (21.42), the quantities N m N m N^(m)N^{m}Nm are the components of the shift in its original primordial contravariant form, whereas the N i = g i m N m N i = g i m N m N_(i)=g_(im)N^(m)N_{i}=g_{i m} N^{m}Ni=gimNm are the covariant components, as calculated within the 3 -geometry with the 3 -metric. To invert this relation,
(21.43) N m = g m s N s (21.43) N m = g m s N s {:(21.43)N^(m)=g^(ms)N_(s):}\begin{equation*} N^{m}=g^{m s} N_{s} \tag{21.43} \end{equation*}(21.43)Nm=gmsNs
Metric of 4-geometry depends on lapse and shift of connectors of the two 3 -geometries
Details of the 4-geometry
is to deal with the reciprocal 3-metric, a quantity that has to be distinguished sharply from the reciprocal 4-metric. Thus, the reciprocal 4-metric is
(21.44) ( 4 ) g 00 ( 4 ) g 0 m ( 4 ) g k 0 ( 4 ) g k m = ( 1 / N 2 ) ( N m / N 2 ) ( N k / N 2 ) ( g k m N k N m / N 2 ) , (21.44) ( 4 ) g 00 ( 4 ) g 0 m ( 4 ) g k 0 ( 4 ) g k m = 1 / N 2 N m / N 2 N k / N 2 g k m N k N m / N 2 , {:(21.44)||[(4)],[g^(00),^((4))g^(0m)],[^((4))g^(k0),(4)g^(km)]||=||[-(1//N^(2)),(N^(m)//N^(2))],[(N^(k)//N^(2)),(g^(km)-N^(k)N^(m)//N^(2))]||",":}\left\|\begin{array}{cc} (4) \tag{21.44}\\ g^{00} & { }^{(4)} g^{0 m} \\ { }^{(4)} g^{k 0} & (4) g^{k m} \end{array}\right\|=\left\|\begin{array}{cc} -\left(1 / N^{2}\right) & \left(N^{m} / N^{2}\right) \\ \left(N^{k} / N^{2}\right) & \left(g^{k m}-N^{k} N^{m} / N^{2}\right) \end{array}\right\|,(21.44)(4)g00(4)g0m(4)gk0(4)gkm=(1/N2)(Nm/N2)(Nk/N2)(gkmNkNm/N2),
a result that one checks by calculating out the product
( 4 ) g α β ( 4 ) g β γ = ( 4 ) δ α γ ( 4 ) g α β ( 4 ) g β γ = ( 4 ) δ α γ ^((4))g_(alpha beta)^((4))g^(beta gamma)=^((4))delta_(alpha)^(gamma){ }^{(4)} g_{\alpha \beta}{ }^{(4)} g^{\beta \gamma}={ }^{(4)} \delta_{\alpha}{ }^{\gamma}(4)gαβ(4)gβγ=(4)δαγ
according to the standard rules for matrix multiplication.
The volume element has the form
(21.45) ( ( 4 ) g ) 1 / 2 d x 0 d x 1 d x 2 d x 3 = N g 1 / 2 d t d x 1 d x 2 d x 3 (21.45) ( 4 ) g 1 / 2 d x 0 d x 1 d x 2 d x 3 = N g 1 / 2 d t d x 1 d x 2 d x 3 {:(21.45)(-^((4))g)^(1//2)dx^(0)dx^(1)dx^(2)dx^(3)=Ng^(1//2)dtdx^(1)dx^(2)dx^(3):}\begin{equation*} \left(-{ }^{(4)} g\right)^{1 / 2} d x^{0} d x^{1} d x^{2} d x^{3}=N g^{1 / 2} d t d x^{1} d x^{2} d x^{3} \tag{21.45} \end{equation*}(21.45)((4)g)1/2dx0dx1dx2dx3=Ng1/2dtdx1dx2dx3
Welding the connectors to the two steel ribbons, or adding the lapse and shift functions to the 3 -metric, by rigidifying the 4 -metric, also automatically determines the components of the unit timelike normal vector n n n\boldsymbol{n}n. The condition of normalization on this 4 -vector is most easily formulated by saying that there exists a 1 -form, also called n n n\boldsymbol{n}n for the sake of convenience, dual to n n n\boldsymbol{n}n, and such that the product of this vector by this 1 -form has the value
(21.46) n , n = 1 . (21.46) n , n = 1 . {:(21.46)(:n","n:)=-1.:}\begin{equation*} \langle\boldsymbol{n}, \boldsymbol{n}\rangle=-1 . \tag{21.46} \end{equation*}(21.46)n,n=1.
This 1-form has the value
(21.47) n = n β d x β = N d t + 0 + 0 + 0 (21.47) n = n β d x β = N d t + 0 + 0 + 0 {:(21.47)n=n_(beta)dx^(beta)=-Ndt+0+0+0:}\begin{equation*} \boldsymbol{n}=n_{\beta} \boldsymbol{d} x^{\beta}=-N \boldsymbol{d} t+0+0+0 \tag{21.47} \end{equation*}(21.47)n=nβdxβ=Ndt+0+0+0
Only so can this 1-form, this structure of layered surfaces, automatically yield a value of unity, one bong of the bell, when pierced as in Figure 2.4 by a vector that represents an advance of one unit in proper time, regardless of what x , y x , y x,yx, yx,y, and z z zzz displacements it also has. Thus the unit timelike normal vector in covariant 1 -form representation necessarily has the components
(21.48) n β = ( N , 0 , 0 , 0 ) (21.48) n β = ( N , 0 , 0 , 0 ) {:(21.48)n_(beta)=(-N","0","0","0):}\begin{equation*} n_{\beta}=(-N, 0,0,0) \tag{21.48} \end{equation*}(21.48)nβ=(N,0,0,0)
Raise the indices via (21.44) to obtain the contravariant components of the same normal, represented as a tangent vector; thus,
(21.49) n α = [ ( 1 / N ) , ( N m / N ) ] . (21.49) n α = ( 1 / N ) , N m / N . {:(21.49)n^(alpha)=[(1//N),-(N^(m)//N)].:}\begin{equation*} n^{\alpha}=\left[(1 / N),-\left(N^{m} / N\right)\right] . \tag{21.49} \end{equation*}(21.49)nα=[(1/N),(Nm/N)].
This result receives a simple interpretation on inspection of Figure 21.2. Thus the typical "perpendicular connector" in the diagram can be said to have the components
( d t , N m d t ) d t , N m d t (dt,-N^(m)dt)\left(d t,-N^{m} d t\right)(dt,Nmdt)
and to have the proper length d τ = N d t d τ = N d t d tau=Ndtd \tau=N d tdτ=Ndt; so, ratioed down to a vector n n n\boldsymbol{n}n of unit proper length, the components are precisely those given by (21.49).

§21.5. INTRINSIC AND EXTRINSIC CURVATURE

The central concept in Einstein's account of gravity is curvature, so it is appropriate to analyze curvature in the language of the ( 3 + 1 ) ( 3 + 1 ) (3+1)(3+1)(3+1)-space-time split. The curvature intrinsic to the 3-geometry of a spacelike hypersurface may be defined and calculated by the same methods described and employed in the calculation of four-dimensional curvature in Chapter 14. Of all measures of the intrinsic curvature, one of the simplest is the Riemann scalar curvature invariant ( 3 ) R ( 3 ) R ^((3))R{ }^{(3)} R(3)R (written for simplicity of notation in what follows without the prefix, as R R RRR ); and of all ways to define this invariant (see Chapter 14), one of the most compact uses the limit (see exercise 21.4)
(21.50) R ( at point under study ) = Lim ε 0 18 4 π ε 2 ( prespere defined as the locus of the a 2-sphere points at a proper distance ε 4 π ε 4 (21.50) R (  at point   under study  ) = Lim ε 0 18 4 π ε 2  prespere defined as the locus of the   a 2-sphere   points at a proper distance  ε 4 π ε 4 {:(21.50)R((" at point ")/(" under study "))=Lim_(epsi rarr0)18(4piepsi^(2)-([" prespere defined as the locus of the "],[" a 2-sphere "],[" points at a proper distance "epsi])/(4piepsi^(4)):}R\binom{\text { at point }}{\text { under study }}=\operatorname{Lim}_{\varepsilon \rightarrow 0} 18 \frac{4 \pi \varepsilon^{2}-\left(\begin{array}{l}\text { prespere defined as the locus of the } \tag{21.50}\\ \text { a 2-sphere } \\ \text { points at a proper distance } \varepsilon\end{array}\right.}{4 \pi \varepsilon^{4}}(21.50)R( at point  under study )=Limε0184πε2( prespere defined as the locus of the  a 2-sphere  points at a proper distance ε4πε4
For a more detailed description of the curvature intrinsic to the 3-geometry, capitalize on differential geometry as already developed in Chapters 8 through 14, amending it only as required to distinguish what is three-dimensional from what is four-dimensional. Begin by considering a displacement
(21.51) d P = e i d x i (21.51) d P = e i d x i {:(21.51)dP=e_(i)dx^(i):}\begin{equation*} \boldsymbol{d} \mathscr{P}=\boldsymbol{e}_{i} \boldsymbol{d} x^{i} \tag{21.51} \end{equation*}(21.51)dP=eidxi
within the hypersurface. Here the e i e i e_(i)\boldsymbol{e}_{i}ei are the basis tangent vectors e i = / x i e i = / x i e_(i)=del//delx^(i)\boldsymbol{e}_{i}=\partial / \partial x^{i}ei=/xi (in one notation) or e i = P / x i e i = P / x i e_(i)=delP//delx^(i)\boldsymbol{e}_{i}=\partial \mathscr{P} / \partial x^{i}ei=P/xi (in another notation) dual to the three coordinate 1 -forms d x i d x i dx^(i)\boldsymbol{d} x^{i}dxi. Any field of tangent vectors A A A\boldsymbol{A}A that happens to lie in the hypersurface lets itself be expressed in terms of the same basis vectors:
(21.52) A = e i A i (21.52) A = e i A i {:(21.52)A=e_(i)A^(i):}\begin{equation*} \boldsymbol{A}=\boldsymbol{e}_{i} A^{i} \tag{21.52} \end{equation*}(21.52)A=eiAi
The scalar product of this vector with the base vector e j e j e_(j)\boldsymbol{e}_{j}ej is
(21.53) ( A e j ) = A i ( e i e j ) = A i g i j = A j (21.53) A e j = A i e i e j = A i g i j = A j {:(21.53)(A*e_(j))=A^(i)(e_(i)*e_(j))=A^(i)g_(ij)=A_(j):}\begin{equation*} \left(\boldsymbol{A} \cdot \boldsymbol{e}_{j}\right)=A^{i}\left(\boldsymbol{e}_{i} \cdot \boldsymbol{e}_{j}\right)=A^{i} g_{i j}=A_{j} \tag{21.53} \end{equation*}(21.53)(Aej)=Ai(eiej)=Aigij=Aj
Now turn attention from a vector at one point to the parallel transport of the vector to a nearby point.
A vector lying on the equator of the Earth and pointing toward the North Star, transported parallel to itself along a meridian to a point still on the Earth's surface, but 1 , 000 km 1 , 000 km 1,000km1,000 \mathrm{~km}1,000 km to the north, will no longer lie in the 2 -geometry of the surface of the Earth. A telescope located in the northern hemisphere has to raise its tube to see the North Star! The generalization to a three-dimensional hypersurface imbedded in a 4-geometry is immediate. Take vector A A A\boldsymbol{A}A, lying in the hypersurface, and transport it along an elementary route lying in the hypersurface, and in the course of this transport displace it at each stage parallel to itself, where "parallel" means parallel with respect to the geometry of the enveloping 4 -manifold. Then A A A\boldsymbol{A}A will ordinarily
Scalar curvature as measure of area deficit
From parallel transport in 4-geometry to parallel transport in 3-geometry
A new covariant derivative, taken with respect to the 3 -geometry
end up no longer lying in the hypersurface. Thus the "covariant derivative" of A A A\boldsymbol{A}A in the direction of the i i iii-th coordinate direction in the geometry of the enveloping spacetime (that is, the A A A\boldsymbol{A}A at the new point diminished by the transported A A A\boldsymbol{A}A ) has the form (see §10.4)
(21.54) ( 4 ) e i A = ( 4 ) i A = ( 4 ) i ( e j A j ) = e j A j x i + ( ( 4 ) Γ j i μ e μ ) A j . (21.54) ( 4 ) e i A = ( 4 ) i A = ( 4 ) i e j A j = e j A j x i + ( 4 ) Γ j i μ e μ A j . {:(21.54)^((4))grad_(e_(i))A=^((4))grad_(i)A=^((4))grad_(i)(e_(j)A^(j))=e_(j)(delA^(j))/(delx^(i))+(^((4))Gamma_(ji)^(mu)e_(mu))A^(j).:}\begin{equation*} { }^{(4)} \nabla_{\boldsymbol{e}_{i}} \boldsymbol{A}={ }^{(4)} \nabla_{i} \boldsymbol{A}={ }^{(4)} \nabla_{i}\left(\boldsymbol{e}_{j} A^{j}\right)=\boldsymbol{e}_{j} \frac{\partial A^{j}}{\partial x^{i}}+\left({ }^{(4)} \Gamma_{j i}^{\mu} \boldsymbol{e}_{\mu}\right) A^{j} . \tag{21.54} \end{equation*}(21.54)(4)eiA=(4)iA=(4)i(ejAj)=ejAjxi+((4)Γjiμeμ)Aj.
A special instance of this formula is the equation for the covariantly measured change of the base vector e m e m e_(m)\boldsymbol{e}_{m}em itself,
(21.55) ( 4 ) i e m = ( 4 ) Γ m i μ e μ . (21.55) ( 4 ) i e m = ( 4 ) Γ m i μ e μ {:(21.55)^((4))grad_(i)e_(m)=^((4))Gamma_(mi)^(mu)e_(mu)". ":}\begin{equation*} { }^{(4)} \boldsymbol{\nabla}_{i} \boldsymbol{e}_{m}={ }^{(4)} \Gamma_{m i}^{\mu} \boldsymbol{e}_{\mu} \text {. } \tag{21.55} \end{equation*}(21.55)(4)iem=(4)Γmiμeμ
In both (21.54) and (21.55) the presence of the "out-of-the-hypersurface component"
(21.56) ( A j ( 4 ) Γ j i 0 ) ( e 0 n ) (21.56) A j ( 4 ) Γ j i 0 e 0 n {:(21.56)(A^(j(4))Gamma_(ji)^(0))(e_(0)*n):}\begin{equation*} \left(A^{j(4)} \Gamma_{j i}^{0}\right)\left(\boldsymbol{e}_{0} \cdot \boldsymbol{n}\right) \tag{21.56} \end{equation*}(21.56)(Aj(4)Γji0)(e0n)
is quite evident. Now kill this component. Project ( 4 ) A ( 4 ) A ^((4))grad A{ }^{(4)} \boldsymbol{\nabla} \boldsymbol{A}(4)A orthogonally onto the hypersurface. In this way arrive at a parallel transport and a covariant derivative that are intrinsic to the 3 -geometry of the hypersurface. By rights this covariant derivative should be written ( 3 ) ( 3 ) ^((3))grad{ }^{(3)} \nabla(3); but for simplicity of notation it will be written as grad\boldsymbol{\nabla} in the rest of this chapter, except where ambiguity might arise. To get the value of the new covariant derivative, one has only to rewrite (21.54) with the suffix ( 4 ) ( 4 ) ^((4)){ }^{(4)}(4) replaced everywhere by a ( 3 ) ( 3 ) ^((3)){ }^{(3)}(3), or, better, dropped altogether and with the "dummy index" of summation μ = ( 0 , 1 , 2 , 3 ) μ = ( 0 , 1 , 2 , 3 ) mu=(0,1,2,3)\mu=(0,1,2,3)μ=(0,1,2,3) replaced by m = ( 1 , 2 , 3 ) m = ( 1 , 2 , 3 ) m=(1,2,3)m=(1,2,3)m=(1,2,3). However, it is more convenient, following Israel (1966), to turn from an expression dealing with contravariant components A i A i A^(i)A^{i}Ai of A A A\boldsymbol{A}A to one dealing with covariant components A i = ( A e i ) A i = A e i A_(i)=(A*e_(i))A_{i}=\left(\boldsymbol{A} \cdot \boldsymbol{e}_{i}\right)Ai=(Aei). Thus the covariant derivative of A A A\boldsymbol{A}A in the direction of the i i iii-th coordinate direction in the hypersurface, calculated with respect to the 3-geometry intrinsic to the hypersurface itself, has for its h h hhh-th covariant component the quantity [see equation (10.18)]
(21.57) A h i = e h ( 3 ) e i A e h i A = A h x i A m Γ m h i ( = A h ; i for A in Σ ) . (21.57) A h i = e h ( 3 ) e i A e h i A = A h x i A m Γ m h i = A h ; i  for  A  in  Σ . {:(21.57)A_(h∣i)=e_(h)*^((3))grad_(e_(i))A-=e_(h)*grad_(i)A=(delA_(h))/(delx^(i))-A^(m)Gamma_(mhi)(=A_(h;i)" for "A" in "Sigma).:}\begin{equation*} A_{h \mid i}=\boldsymbol{e}_{h} \cdot{ }^{(3)} \boldsymbol{\nabla}_{\boldsymbol{e}_{i}} \boldsymbol{A} \equiv \boldsymbol{e}_{h} \cdot \boldsymbol{\nabla}_{i} \boldsymbol{A}=\frac{\partial A_{h}}{\partial x^{i}}-A^{m} \Gamma_{m h i}\left(=A_{h ; i} \text { for } \boldsymbol{A} \text { in } \Sigma\right) . \tag{21.57} \end{equation*}(21.57)Ahi=eh(3)eiAehiA=AhxiAmΓmhi(=Ah;i for A in Σ).
Here the notation of the vertical stroke distinguishes this covariant derivative from the covariant derivative taken with respect to the 4 -geometry, as, for example, in equations ( 10.17 ff ). The connection coefficients here for three dimensions, like those dealt with earlier for four dimensions [see the equations leading from (14.14) through (14.15)], allow themselves to be expressed in terms of the metric coefficients and their first derivatives, and have the interpretation
(21.58) ( 3 ) Γ m h i Γ m h i = e m i e h . (21.58) ( 3 ) Γ m h i Γ m h i = e m i e h . {:(21.58)^((3))Gamma_(mhi)-=Gamma_(mhi)=e_(m)*grad_(i)e_(h).:}\begin{equation*} { }^{(3)} \Gamma_{m h i} \equiv \Gamma_{m h i}=\boldsymbol{e}_{m} \cdot \nabla_{i} \boldsymbol{e}_{h} . \tag{21.58} \end{equation*}(21.58)(3)ΓmhiΓmhi=emieh.
From the connection coefficients in turn, one calculates as in Chapter 14 the full Riemann curvature tensor ( 3 ) R i j m n ( 3 ) R i j m n ^((3))R^(i)_(jmn){ }^{(3)} R^{i}{ }_{j m n}(3)Rijmn of the 3-geometry intrinsic to the hypersurface.
Over and above the curvature intrinsic to the simultaneity, one now encounters a concept not covered in previous chapters (except fleetingly in Box 14.1), the extrinsic curvature of the 3 -geometry. This idea has no meaning for a 3-geometry
Figure 21.3.
Extrinsic curvature measures the fractional shrinkage and deformation of a figure lying in the spacelike hypersurface Σ Σ Sigma\SigmaΣ that takes place when each point in the figure is carried forward a unit interval of proper time "normal" to the hypersurface out into the enveloping spacetime. (No enveloping spacetime? No extrinsic curvature!) The extrinsic curvature tensor is a positive multiple of the unit tensor when elementary displacements δ P δ P deltaP\delta \mathscr{P}δP, in whatever direction within the surface they point, all experience the same fractional shrinkage. Thus the extrinsic curvature of the hypersurface illustrated in the figure is positive. The dashed arrow represents the normal vector n n n\boldsymbol{n}n at the fiducial point P P P\mathscr{P}P after parallel transport to the nearby point P + δ P P + δ P P+deltaP\mathscr{P}+\delta \mathscr{P}P+δP.
conceived in and by itself. It depends for its existence on this 3-geometry's being imbedded as a well-defined slice in a well-defined enveloping spacetime. It measures the curvature of this slice relative to that enveloping 4-geometry (Figure 21.3).
Take the normal that now stands at the point P P P\mathscr{P}P and, "keeping its base in the hypersurface" Σ Σ Sigma\SigmaΣ, transport it parallel to itself as a "fiducial vector" to the point P + δ P P + δ P P+deltaP\mathscr{P}+\delta \mathscr{P}P+δP, and there subtract it from the normal vector that already stands at that point. The difference, δ n δ n delta n\delta \boldsymbol{n}δn, may be regarded in the appropriate approximation as a "vector," the value of which is governed by and depends linearly on the "vector" of displacement δ P δ P deltaP\delta \mathscr{P}δP.
To obviate any appeal to the notion of approximation, go from the finite displacement δ P δ P deltaP\delta \mathscr{P}δP to the limiting concept of the vector-valued "displacement 1-form" d P d P dP\boldsymbol{d} \mathscr{P}dP [see equation 15.13]. Also replace the finite but not rigorously defined vector δ n δ n delta n\delta \boldsymbol{n}δn by the limiting concept of a vector-valued 1 -form d n d n dn\boldsymbol{d} \boldsymbol{n}dn. This quantity, regarded as a vector, being the change in a vector n n n\boldsymbol{n}n that does not change in length, must represent a change in direction and thus stand perpendicular to n n n\boldsymbol{n}n. Therefore it can be regarded as lying in the hypersurface Σ Σ Sigma\SigmaΣ. Depending linearly on d P d P dP\boldsymbol{d} \mathscr{P}dP, it can be represented in the form
(21.59) d n = K ( d P ) (21.59) d n = K ( d P ) {:(21.59)dn=-K(dP):}\begin{equation*} \boldsymbol{d} \boldsymbol{n}=-\boldsymbol{K}(\boldsymbol{d} \mathscr{P}) \tag{21.59} \end{equation*}(21.59)dn=K(dP)
Here the linear operator K K K\boldsymbol{K}K is the extrinsic curvature presented as an abstract coor-dinate-independent geometric object. The sign of K K K\boldsymbol{K}K as defined here is positive when the tips of the normals in Figure 21.3 are closer than their bases, as they are, for example, during the recontraction of a model universe, in agreement with the conventions employed by Eisenhart (1926), Schouten (1954), and Arnowitt, Deser and Misner (1962), but opposite to the convention of Israel (1966).
Into the slots in the 1 -forms that appear on the lefthand and righthand sides of (21.59), insert in place of the general tangent vector [which is to describe the general
Extrinsic curvature as an operator
local displacement, so far left open, as in the discussion following (2.12a)] a very special tangent vector, the basis vector e i e i e_(i)\boldsymbol{e}_{i}ei, for a displacement in the i i iii-th coordinate direction. Thus find (21.59) reading
(21.60) ( 4 ) i n = K ( e i ) = K i j e j , (21.60) ( 4 ) i n = K e i = K i j e j , {:(21.60)^((4))grad_(i)n=-K(e_(i))=-K_(i)^(j)e_(j)",":}\begin{equation*} { }^{(4)} \nabla_{i} \boldsymbol{n}=-\boldsymbol{K}\left(\boldsymbol{e}_{i}\right)=-K_{i}^{j} \boldsymbol{e}_{j}, \tag{21.60} \end{equation*}(21.60)(4)in=K(ei)=Kijej,
where the K i j K i j K_(i)^(j)K_{i}{ }^{j}Kij are the components of the linear operator K K K\boldsymbol{K}K in a coordinate representation. Take the scalar product of both sides of (21.60) with the basis vector e m e m e_(m)\boldsymbol{e}_{m}em. Recall ( e m n ) = 0 e m n = 0 (e_(m)*n)=0\left(\boldsymbol{e}_{m} \cdot \boldsymbol{n}\right)=0(emn)=0. Thus establish the symmetry of the tensor K i m K i m K_(im)K_{i m}Kim, covariantly presented, in its two indices:
K i m = K i j g i m = K i j ( e j e m ) = e m ( 4 ) i n = n ( 4 ) i e m (21.61) ϵ ¯ ( n e 0 ) ( 4 ) Γ m i 0 = n ( 4 ) m e i = K m i . [ see ( 21.55 ) ] K i m = K i j g i m = K i j e j e m = e m ( 4 ) i n = n ( 4 ) i e m (21.61) ϵ ¯ ¯ n e 0 ( 4 ) Γ m i 0 = n ( 4 ) m e i = K m i . [  see  ( 21.55 ) ] {:[K_(im)=K_(i)^(j)g_(im)=K_(i)^(j)(e_(j)*e_(m))=-e_(m)*^((4))grad_(i)n=n*^((4))grad_(i)e_(m)],[(21.61) bar(bar(epsilon))(n*e_(0))^((4))Gamma_(mi)^(0)=n*^((4))grad_(m)e_(i)=K_(mi).],[[" see "(21.55)]]:}\begin{align*} K_{i m} & =K_{i}^{j} g_{i m}=K_{i}^{j}\left(\boldsymbol{e}_{j} \cdot \boldsymbol{e}_{m}\right)=-\boldsymbol{e}_{m} \cdot{ }^{(4)} \nabla_{i} \boldsymbol{n}=\boldsymbol{n} \cdot{ }^{(4)} \nabla_{i} \boldsymbol{e}_{m} \\ & \overline{\bar{\epsilon}}\left(\boldsymbol{n} \cdot \boldsymbol{e}_{0}\right)^{(4)} \Gamma_{m i}^{0}=\boldsymbol{n} \cdot{ }^{(4)} \nabla_{m} \boldsymbol{e}_{i}=K_{m i} . \tag{21.61}\\ & {[\text { see }(21.55)] } \end{align*}Kim=Kijgim=Kij(ejem)=em(4)in=n(4)iem(21.61)ϵ¯(ne0)(4)Γmi0=n(4)mei=Kmi.[ see (21.55)]
A knowledge of the tensor K i j K i j K_(ij)K_{i j}Kij of extrinsic curvature assists in revealing the changes of the four vectors n , e 1 , e 2 , e 3 n , e 1 , e 2 , e 3 n,e_(1),e_(2),e_(3)\boldsymbol{n}, \boldsymbol{e}_{1}, \boldsymbol{e}_{2}, \boldsymbol{e}_{3}n,e1,e2,e3 under parallel transport. Equation (21.60) already tells how n n n\boldsymbol{n}n changes under parallel transport. The change of e m e m e_(m)\boldsymbol{e}_{m}em is to be read off from (21.55) as a vector. It is adequate identification of this vector to know its scalar product with each of four independent vectors: with the basis vectors e 1 , e 2 e 1 , e 2 e_(1),e_(2)\boldsymbol{e}_{1}, \boldsymbol{e}_{2}e1,e2, and e 3 e 3 e_(3)\boldsymbol{e}_{3}e3, or, more briefly, with e s e s e_(s)\boldsymbol{e}_{s}es, in (21.58); and with the normal vector n n n\boldsymbol{n}n in (21.61). Thus one arrives, following Israel (1966), at what are known as the equations of Gauss and Weingarten, in happy oversight of all change of notation in the intervening century:
(21.62) ( 4 ) i e j = K i j n n n + ( 3 ) Γ j i h e h . (21.62) ( 4 ) i e j = K i j n n n + ( 3 ) Γ j i h e h . {:(21.62)^((4))grad_(i)e_(j)=K_(ij)(n)/(n*n)+^((3))Gamma_(ji)^(h)e_(h).:}\begin{equation*} { }^{(4)} \boldsymbol{\nabla}_{i} \boldsymbol{e}_{j}=K_{i j} \frac{\boldsymbol{n}}{\boldsymbol{n} \cdot \boldsymbol{n}}+{ }^{(3)} \Gamma_{j i}^{h} \boldsymbol{e}_{h} . \tag{21.62} \end{equation*}(21.62)(4)iej=Kijnnn+(3)Γjiheh.
Knowing from this equation how each basis vector in Σ Σ Sigma\SigmaΣ changes, one also knows how to rewrite (21.54) for the change in any vector field A A A\boldsymbol{A}A that lies in Σ Σ Sigma\SigmaΣ. The change in both cases is expressed relative to a fiducial vector transported from a fiducial nearby point. By the term "parallel transport" one now means "parallel with respect to the geometry of the enveloping spacetime":
(21.63) ( 4 ) i A = A i j e j + K i j A j n ( n n ) (21.63) ( 4 ) i A = A i j e j + K i j A j n ( n n ) {:(21.63)^((4))grad_(i)A=A_(∣i)^(j)e_(j)+K_(ij)A^(j)(n)/((n*n)):}\begin{equation*} { }^{(4)} \boldsymbol{\nabla}_{i} \boldsymbol{A}=A_{\mid i}^{j} \boldsymbol{e}_{j}+K_{i j} A^{j} \frac{\boldsymbol{n}}{(\boldsymbol{n} \cdot \boldsymbol{n})} \tag{21.63} \end{equation*}(21.63)(4)iA=Aijej+KijAjn(nn)
Of special importance is the evaluation of extrinsic curvature when spacetime is sliced up into spacelike slices according to the plan of Arnowitt, Deser, and Misner as described in § 21.4 § 21.4 §21.4\S 21.4§21.4. The 4 -geometry of the thin sandwich illustrated in Figure 21.2 , rudimentary though it is, is fully defined by the 3 -metric on the two faces of the sandwich and by the lapse and shift functions N N NNN and N i N i N^(i)N^{i}Ni. The normal in covariant representation according to (21.47) has the components
(21.64) ( n 0 , n 1 , n 2 , n 3 ) = ( N , 0 , 0 , 0 ) (21.64) n 0 , n 1 , n 2 , n 3 = ( N , 0 , 0 , 0 ) {:(21.64)(n_(0),n_(1),n_(2),n_(3))=(-N","0","0","0):}\begin{equation*} \left(n_{0}, n_{1}, n_{2}, n_{3}\right)=(-N, 0,0,0) \tag{21.64} \end{equation*}(21.64)(n0,n1,n2,n3)=(N,0,0,0)
The change in n n n\boldsymbol{n}n relative to " n n n\boldsymbol{n}n transported parallel to itself in the enveloping 4 -geometry," according to the definition of parallel transport, is
( d n ) i = n i ; k d x k (21.65) = [ n i x k ( 4 ) Γ i k σ n σ ] d x k = N ( 4 ) Γ i k 0 d x k ( d n ) i = n i ; k d x k (21.65) = n i x k ( 4 ) Γ i k σ n σ d x k = N ( 4 ) Γ i k 0 d x k {:[(dn)_(i)=n_(i;k)dx^(k)],[(21.65)=[(deln_(i))/(delx^(k))-^((4))Gamma_(ik)^(sigma)n_(sigma)]dx^(k)],[=N^((4))Gamma_(ik)^(0)dx^(k)]:}\begin{align*} (\boldsymbol{d} \boldsymbol{n})_{i} & =n_{i ; k} \boldsymbol{d} x^{k} \\ & =\left[\frac{\partial n_{i}}{\partial x^{k}}-{ }^{(4)} \Gamma_{i k}^{\sigma} n_{\sigma}\right] \boldsymbol{d} x^{k} \tag{21.65}\\ & =N^{(4)} \Gamma_{i k}^{0} \boldsymbol{d} x^{k} \end{align*}(dn)i=ni;kdxk(21.65)=[nixk(4)Γikσnσ]dxk=N(4)Γik0dxk
Compare to the same change as expressed in terms of the extrinsic curvature tensor,
(21.66) ( d n ) i = K i k d x k (21.66) ( d n ) i = K i k d x k {:(21.66)(dn)_(i)=-K_(ik)dx^(k):}\begin{equation*} (\boldsymbol{d} \boldsymbol{n})_{i}=-K_{i k} \boldsymbol{d} x^{k} \tag{21.66} \end{equation*}(21.66)(dn)i=Kikdxk
Conclude that this tensor has the value
K i k = n i ; k = N ( 4 ) Γ i k 0 = N [ ( 4 ) g 00 ( 4 ) Γ 0 i k + ( 4 ) g 0 p ( 4 ) Γ p i k ] K i k = n i ; k = N ( 4 ) Γ i k 0 = N ( 4 ) g 00 ( 4 ) Γ 0 i k + ( 4 ) g 0 p ( 4 ) Γ p i k K_(ik)=-n_(i;k)=-N^((4))Gamma_(ik)^(0)=-N[^((4))g^(00(4))Gamma_(0ik)+^((4))g^(0p(4))Gamma_(pik)]K_{i k}=-n_{i ; k}=-N^{(4)} \Gamma_{i k}^{0}=-N\left[{ }^{(4)} g^{00(4)} \Gamma_{0 i k}+{ }^{(4)} g^{0 p(4)} \Gamma_{p i k}\right]Kik=ni;k=N(4)Γik0=N[(4)g00(4)Γ0ik+(4)g0p(4)Γpik]
or, with the help of equations (21.42) and (21.44),
K i k = ( 1 / N ) [ ( 4 ) Γ 0 i k N p ( 3 ) Γ p i k ] (21.67) = 1 2 N [ N i x k + N k x i g i k t 2 Γ p i k N p ] = 1 2 N [ N i k + N k i g i k t ] K i k = ( 1 / N ) ( 4 ) Γ 0 i k N p ( 3 ) Γ p i k (21.67) = 1 2 N N i x k + N k x i g i k t 2 Γ p i k N p = 1 2 N N i k + N k i g i k t {:[K_(ik)=(1//N)[^((4))Gamma_(0ik)-N^(p(3))Gamma_(pik)]],[(21.67)=(1)/(2N)[(delN_(i))/(delx^(k))+(delN_(k))/(delx^(i))-(delg_(ik))/(del t)-2Gamma_(pik)N^(p)]],[=(1)/(2N)[N_(i∣k)+N_(k∣i)-(delg_(ik))/(del t)]]:}\begin{align*} & K_{i k}=(1 / N)\left[{ }^{(4)} \Gamma_{0 i k}-N^{p(3)} \Gamma_{p i k}\right] \\ = & \frac{1}{2 N}\left[\frac{\partial N_{i}}{\partial x^{k}}+\frac{\partial N_{k}}{\partial x^{i}}-\frac{\partial g_{i k}}{\partial t}-2 \Gamma_{p i k} N^{p}\right] \tag{21.67}\\ = & \frac{1}{2 N}\left[N_{i \mid k}+N_{k \mid i}-\frac{\partial g_{i k}}{\partial t}\right] \end{align*}Kik=(1/N)[(4)Γ0ikNp(3)Γpik](21.67)=12N[Nixk+Nkxigikt2ΓpikNp]=12N[Nik+Nkigikt]
This is the extrinsic curvature expressed in terms of the ADM lapse and shift functions [Arnowitt, Deser, and Misner (1962)].
As an example, let Σ Σ Sigma\SigmaΣ have the geometry of a 3 -sphere
(21.68) d s 2 = a 2 [ d χ 2 + sin 2 χ ( d θ 2 + sin 2 θ d ϕ 2 ) ] (21.68) d s 2 = a 2 d χ 2 + sin 2 χ d θ 2 + sin 2 θ d ϕ 2 {:(21.68)ds^(2)=a^(2)[dchi^(2)+sin^(2)chi(dtheta^(2)+sin^(2)theta dphi^(2))]:}\begin{equation*} d s^{2}=a^{2}\left[d \chi^{2}+\sin ^{2} \chi\left(d \theta^{2}+\sin ^{2} \theta d \phi^{2}\right)\right] \tag{21.68} \end{equation*}(21.68)ds2=a2[dχ2+sin2χ(dθ2+sin2θdϕ2)]
Let the nearby spacelike slice in the one-parameter family of slices, the slice with the label t + d t t + d t t+dtt+d tt+dt (only a label!) have a 3 -metric given by the same formula with the radius a a aaa replaced by a + d a a + d a a+daa+d aa+da. The 4 -geometry of the thin sandwich between these two slices is completely undetermined until one gives the lapse and shift functions. For simplicity, take the shift vector N i N i N^(i)N^{i}Ni (see Figure 21.2) to be everywhere zero and the lapse function at every point on Σ Σ Sigma\SigmaΣ to have the same value N N NNN. The separation in proper time between the two spheres is thus d τ = N d t d τ = N d t d tau=Ndtd \tau=N d tdτ=Ndt. Any geometric figure located in Σ Σ Sigma\SigmaΣ expands with time. The fractional increase of any length in this figure per unit of proper time is the same in whatever direction that length is oriented, and has the value
(21.69) ( fractional increase of length per unit of proper time ) = 1 a d a d τ = 1 2 N 1 a 2 d ( a 2 ) d t (21.69)  fractional increase   of length per unit   of proper time  = 1 a d a d τ = 1 2 N 1 a 2 d a 2 d t {:(21.69)([" fractional increase "],[" of length per unit "],[" of proper time "])=(1)/(a)(da)/(d tau)=(1)/(2N)(1)/(a^(2))(d(a^(2)))/(dt):}\left(\begin{array}{l} \text { fractional increase } \tag{21.69}\\ \text { of length per unit } \\ \text { of proper time } \end{array}\right)=\frac{1}{a} \frac{d a}{d \tau}=\frac{1}{2 N} \frac{1}{a^{2}} \frac{d\left(a^{2}\right)}{d t}(21.69)( fractional increase  of length per unit  of proper time )=1adadτ=12N1a2d(a2)dt
The negative of this quantity, multiplied by the ( 1 1 ) ( 1 1 ) ((1)/(1))\binom{1}{1}(11) unit tensor, 1 = d P 1 = d P 1=dP\boldsymbol{1}=\boldsymbol{d} \mathscr{P}1=dP, gives the extrinsic curvature tensor in ( 1 1 ) ( 1 1 ) ((1)/(1))\binom{1}{1}(11) representation,
(21.70) K = 1 2 N 1 a 2 d ( a 2 ) d t 1 (21.70) K = 1 2 N 1 a 2 d a 2 d t 1 {:(21.70)K=-(1)/(2N)(1)/(a^(2))(d(a^(2)))/(dt)1:}\begin{equation*} \boldsymbol{K}=-\frac{1}{2 N} \frac{1}{a^{2}} \frac{d\left(a^{2}\right)}{d t} \mathbf{1} \tag{21.70} \end{equation*}(21.70)K=12N1a2d(a2)dt1
One confirms this result (exercise 21.5) by direct calculation of the components K i j K i j K_(i)^(j)K_{i}^{j}Kij using the ADM formula (21.67) as the starting point.
The Riemann curvature R a b c d = ( 3 ) R a b c d R a b c d = ( 3 ) R a b c d R^(a)_(bcd)=^((3))R^(a)_(bcd)R^{a}{ }_{b c d}={ }^{(3)} R^{a}{ }_{b c d}Rabcd=(3)Rabcd intrinsic to the hypersurface Σ Σ Sigma\SigmaΣ, together with the extrinsic curvature K i j K i j K_(ij)K_{i j}Kij, give one information on the Riemann and Einstein curvatures of the 4 -geometry. In the calculation, it is not convenient to use the coordinate basis,
basis vectors, basis 1 -forms
e 0 = t , d t , e i = i , d x i , e 0 = t , d t , e i = i , d x i , {:[e_(0)=del_(t)",",dt","],[e_(i)=del_(i)",",dx^(i)","]:}\begin{array}{lr} \boldsymbol{e}_{0}=\partial_{t}, & \boldsymbol{d} t, \\ \boldsymbol{e}_{i}=\partial_{i}, & \boldsymbol{d} x^{i}, \end{array}e0=t,dt,ei=i,dxi,
because ordinarily the basis vector e 0 e 0 e_(0)\boldsymbol{e}_{0}e0 does not stand perpendicular to the hypersurface (see Figure 21.2). Adopt a different basis but one that is still self-dual,
(21.71) basis vectors, basis 1-forms, e n n = N 1 ( t N m m ) , w n = N d t = ( n n ) n e i = i , w i d x i + N i d t . (21.71)  basis vectors,   basis 1-forms,  e n n = N 1 t N m m , w n = N d t = ( n n ) n e i = i , w i d x i + N i d t . {:(21.71){:[" basis vectors, "," basis 1-forms, "],[e_(n)-=n=N^(-1)(del_(t)-N^(m)del_(m))",",w^(n)=Ndt=(n*n)n],[e_(i)=del_(i)",",w^(i)-=dx^(i)+N^(i)dt.]:}:}\begin{array}{cc} \text { basis vectors, } & \text { basis 1-forms, } \\ \boldsymbol{e}_{n} \equiv \boldsymbol{n}=N^{-1}\left(\partial_{t}-N^{m} \partial_{m}\right), & \boldsymbol{w}^{n}=N \boldsymbol{d} t=(\boldsymbol{n} \cdot \boldsymbol{n}) \boldsymbol{n} \tag{21.71}\\ \boldsymbol{e}_{i}=\partial_{i}, & \boldsymbol{w}^{i} \equiv \boldsymbol{d} x^{i}+N^{i} \boldsymbol{d} t . \end{array}(21.71) basis vectors,  basis 1-forms, enn=N1(tNmm),wn=Ndt=(nn)nei=i,widxi+Nidt.
Also use Greek labels α ¯ = n , 1 , 2 , 3 α ¯ = n , 1 , 2 , 3 bar(alpha)=n,1,2,3\bar{\alpha}=n, 1,2,3α¯=n,1,2,3, instead of Greek labels α = 0 , 1 , 2 , 3 α = 0 , 1 , 2 , 3 alpha=0,1,2,3\alpha=0,1,2,3α=0,1,2,3, to list components.
Recall that curvature is measured by the change in a vector on transport around a closed route; or, from equation (14.23),
(21.72) R ( u , v ) w = u v w v u w [ u , v ] w . (21.72) R ( u , v ) w = u v w v u w [ u , v ] w . {:(21.72)R(u","v)w=grad_(u)grad_(v)w-grad_(v)grad_(u)w-grad_([u,v])w.:}\begin{equation*} \mathscr{R}(u, v) w=\nabla_{u} \nabla_{v} w-\nabla_{v} \nabla_{u} w-\nabla_{[u, v]} w . \tag{21.72} \end{equation*}(21.72)R(u,v)w=uvwvuw[u,v]w.
Let the vector transported be e i e i e_(i)\boldsymbol{e}_{i}ei and let the route be defined by e j e j e_(j)\boldsymbol{e}_{j}ej and e k e k e_(k)\boldsymbol{e}_{k}ek. The latter two vectors belong to a coordinate basis. Therefore the "route closes automatically", [ e j , e k ] = 0 e j , e k = 0 [e_(j),e_(k)]=0\left[\boldsymbol{e}_{j}, \boldsymbol{e}_{k}\right]=0[ej,ek]=0, and the final term in (21.72) drops out of consideration. Call on (21.62) and (21.60) to find
( 4 ) e j ( 4 ) e k e i = ( 4 ) e j [ K i k n ( n n ) + ( 3 ) Γ i k m e m ] (21.73) = K i k , j n ( n n ) K i k K j m e m 1 ( n n ) + ( 3 ) Γ i k , j m e m + ( 3 ) Γ i k m [ K m j n ( n n ) + ( 3 ) Γ m j s e s ] . ( 4 ) e j ( 4 ) e k e i = ( 4 ) e j K i k n ( n n ) + ( 3 ) Γ i k m e m (21.73) = K i k , j n ( n n ) K i k K j m e m 1 ( n n ) + ( 3 ) Γ i k , j m e m + ( 3 ) Γ i k m K m j n ( n n ) + ( 3 ) Γ m j s e s . {:[^((4))grad_(e_(j))^((4))grad_(e_(k))e_(i)=^((4))grad_(e_(j))[K_(ik)(n)/((n*n))+^((3))Gamma_(ik)^(m)e_(m)]],[(21.73)=K_(ik,j)(n)/((n*n))-K_(ik)K_(j)^(m)e_(m)(1)/((n*n))+^((3))Gamma_(ik,j)^(m)e_(m)],[+^((3))Gamma_(ik)^(m)[K_(mj)(n)/((n*n))+^((3))Gamma_(mj)^(s)e_(s)].]:}\begin{align*} { }^{(4)} \boldsymbol{\nabla}_{\boldsymbol{e}_{j}}{ }^{(4)} \boldsymbol{\nabla}_{\boldsymbol{e}_{k}} \boldsymbol{e}_{i}= & { }^{(4)} \boldsymbol{\nabla}_{\boldsymbol{e}_{j}}\left[K_{i k} \frac{\boldsymbol{n}}{(\boldsymbol{n} \cdot \boldsymbol{n})}+{ }^{(3)} \Gamma_{i k}^{m} \boldsymbol{e}_{m}\right] \\ = & K_{i k, j} \frac{\boldsymbol{n}}{(\boldsymbol{n} \cdot \boldsymbol{n})}-K_{i k} K_{j}^{m} \boldsymbol{e}_{m} \frac{1}{(\boldsymbol{n} \cdot \boldsymbol{n})}+{ }^{(3)} \Gamma_{i k, j}^{m} \boldsymbol{e}_{m} \tag{21.73}\\ & +{ }^{(3)} \Gamma_{i k}^{m}\left[K_{m j} \frac{\boldsymbol{n}}{(\boldsymbol{n} \cdot \boldsymbol{n})}+{ }^{(3)} \Gamma_{m j}^{s} \boldsymbol{e}_{s}\right] . \end{align*}(4)ej(4)ekei=(4)ej[Kikn(nn)+(3)Γikmem](21.73)=Kik,jn(nn)KikKjmem1(nn)+(3)Γik,jmem+(3)Γikm[Kmjn(nn)+(3)Γmjses].
Evaluate similarly the term with indices j j jjj and k k kkk reversed, subtract from (21.73), simplify, and find
(21.74) R ( e j , e k ) e i = ( K i k j K i j k ) n ( n n ) + [ ( n n ) 1 ( K i j K k m K i k K j m ) + ( 3 ) R m i j k ] e m . (21.74) R e j , e k e i = K i k j K i j k n ( n n ) + ( n n ) 1 K i j K k m K i k K j m + ( 3 ) R m i j k e m . {:[(21.74)R(e_(j),e_(k))e_(i)=(K_(ik∣j)-K_(ij∣k))(n)/((n*n))],[+[(n*n)^(-1)(K_(ij)K_(k)^(m)-K_(ik)K_(j)^(m))+^((3))R^(m)_(ijk)]e_(m).]:}\begin{align*} \mathscr{R}\left(\boldsymbol{e}_{j}, \boldsymbol{e}_{k}\right) \boldsymbol{e}_{i}= & \left(K_{i k \mid j}-K_{i j \mid k}\right) \frac{\boldsymbol{n}}{(\boldsymbol{n} \cdot \boldsymbol{n})} \tag{21.74}\\ & +\left[(\boldsymbol{n} \cdot \boldsymbol{n})^{-1}\left(K_{i j} K_{k}{ }^{m}-K_{i k} K_{j}^{m}\right)+{ }^{(3)} R^{m}{ }_{i j k}\right] \boldsymbol{e}_{m} . \end{align*}(21.74)R(ej,ek)ei=(KikjKijk)n(nn)+[(nn)1(KijKkmKikKjm)+(3)Rmijk]em.
The coefficients give directly the desired components of the curvature tensor
(21.75) ( 4 ) R m i j k = ( 3 ) R m i j k + ( n n ) 1 ( K i j K k m K i k K j m ) (21.75) ( 4 ) R m i j k = ( 3 ) R m i j k + ( n n ) 1 K i j K k m K i k K j m {:(21.75)^((4))R^(m)_(ijk)=^((3))R^(m)_(ijk)+(n*n)^(-1)(K_(ij)K_(k)^(m)-K_(ik)K_(j)^(m)):}\begin{equation*} { }^{(4)} R^{m}{ }_{i j k}={ }^{(3)} R^{m}{ }_{i j k}+(\boldsymbol{n} \cdot \boldsymbol{n})^{-1}\left(K_{i j} K_{k}^{m}-K_{i k} K_{j}^{m}\right) \tag{21.75} \end{equation*}(21.75)(4)Rmijk=(3)Rmijk+(nn)1(KijKkmKikKjm)
and
(21.76) ( 4 ) R n i j k = ( n n ) 1 ( 4 ) R n i j k = ( n n ) 1 ( K i j k K i k j ) . (21.76) ( 4 ) R n i j k = ( n n ) 1 ( 4 ) R n i j k = ( n n ) 1 K i j k K i k j . {:(21.76)^((4))R^(n)_(ijk)=(n*n)^(-1(4))R_(nijk)=-(n*n)^(-1)(K_(ij∣k)-K_(ik∣j)).:}\begin{equation*} { }^{(4)} R^{n}{ }_{i j k}=(\boldsymbol{n} \cdot \boldsymbol{n})^{-1(4)} R_{n i j k}=-(\boldsymbol{n} \cdot \boldsymbol{n})^{-1}\left(K_{i j \mid k}-K_{i k \mid j}\right) . \tag{21.76} \end{equation*}(21.76)(4)Rnijk=(nn)1(4)Rnijk=(nn)1(KijkKikj).
Equations (21.75) and (21.76) are known as the equations of Gauss and Codazzi [for literature, see Eisenhart (1926)]. It follows from (21.75) that the components of the curvature of the 3 -geometry will normally only then agree with the corresponding components of the curvature of the 4 -geometry when the imbedding happens to be accomplished at the point under study with a hypersurface free of extrinsic curvature. The directly opposite situation is illustrated by the familiar example of a 2 -sphere imbedded in a flat 3 -space, where the lefthand side of (21.75) (with dimensions lowered by one unit throughout!) is zero, and the extrinsic and intrinsic curvature on the right exactly cancel.
Important components of the Einstein curvature let themselves be evaluated from the Gauss-Codazzi results. In doing the calculation, it is simplest to think of e i , e j e i , e j e_(i),e_(j)\boldsymbol{e}_{i}, \boldsymbol{e}_{j}ei,ej and e k e k e_(k)\boldsymbol{e}_{k}ek as being an orthonormal tetrad, n n n\boldsymbol{n}n being itself already normalized and orthogonal to every vector in the hypersurface. Then, employing (14.7) and (21.75), one finds
G 0 0 = ( 4 ) R 12 12 + ( 4 ) R 23 23 + ( 4 ) R 31 31 = ( 3 ) R 12 12 + ( 3 ) R 23 23 + ( 3 ) R 31 31 (21.77) + ( n n ) 1 [ ( K 1 2 K 2 1 K 2 2 K 1 1 ) + ( K 2 3 K 3 2 K 3 3 K 2 2 ) + ( K 3 1 K 1 3 K 1 1 K 3 3 ) ] = 1 2 R 1 2 ( n n ) 1 [ ( Tr K ) 2 Tr ( K 2 ) ] . G 0 0 = ( 4 ) R 12 12 + ( 4 ) R 23 23 + ( 4 ) R 31 31 = ( 3 ) R 12 12 + ( 3 ) R 23 23 + ( 3 ) R 31 31 (21.77) + ( n n ) 1 K 1 2 K 2 1 K 2 2 K 1 1 + K 2 3 K 3 2 K 3 3 K 2 2 + K 3 1 K 1 3 K 1 1 K 3 3 = 1 2 R 1 2 ( n n ) 1 ( Tr K ) 2 Tr K 2 . {:[-G_(0)^(0)=^((4))R^(12)_(12)+^((4))R^(23)_(23)+^((4))R^(31)_(31)],[=^((3))R^(12)_(12)+^((3))R^(23)_(23)+^((3))R^(31)_(31)],[(21.77)+(n*n)^(-1)[(K_(1)^(2)K_(2)^(1)-K_(2)^(2)K_(1)^(1))+(K_(2)^(3)K_(3)^(2)-K_(3)^(3)K_(2)^(2)):}],[{:+(K_(3)^(1)K_(1)^(3)-K_(1)^(1)K_(3)^(3))]],[=(1)/(2)R-(1)/(2)(n*n)^(-1)[(Tr K)^(2)-Tr(K^(2))].]:}\begin{align*} -G_{0}^{0}= & { }^{(4)} R^{12}{ }_{12}+{ }^{(4)} R^{23}{ }_{23}+{ }^{(4)} R^{31}{ }_{31} \\ = & { }^{(3)} R^{12}{ }_{12}+{ }^{(3)} R^{23}{ }_{23}+{ }^{(3)} R^{31}{ }_{31} \\ & +(\boldsymbol{n} \cdot \boldsymbol{n})^{-1}\left[\left(K_{1}^{2} K_{2}^{1}-K_{2}^{2} K_{1}^{1}\right)+\left(K_{2}^{3} K_{3}^{2}-K_{3}^{3} K_{2}^{2}\right)\right. \tag{21.77}\\ & \left.+\left(K_{3}^{1} K_{1}^{3}-K_{1}^{1} K_{3}^{3}\right)\right] \\ = & \frac{1}{2} R-\frac{1}{2}(\boldsymbol{n} \cdot \boldsymbol{n})^{-1}\left[(\operatorname{Tr} \boldsymbol{K})^{2}-\operatorname{Tr}\left(\boldsymbol{K}^{2}\right)\right] . \end{align*}G00=(4)R1212+(4)R2323+(4)R3131=(3)R1212+(3)R2323+(3)R3131(21.77)+(nn)1[(K12K21K22K11)+(K23K32K33K22)+(K31K13K11K33)]=12R12(nn)1[(TrK)2Tr(K2)].
Here R R RRR is the 3-dimensional scalar curvature invariant and Tr stands for "trace of"; thus,
(21.78) Tr κ = g i j K i j = g i j K i j = K j j (21.78) Tr κ = g i j K i j = g i j K i j = K j j {:(21.78)Tr kappa=g^(ij)K_(ij)=g_(ij)K^(ij)=K_(j)^(j):}\begin{equation*} \operatorname{Tr} \boldsymbol{\kappa}=g^{i j} K_{i j}=g_{i j} K^{i j}=K_{j}^{j} \tag{21.78} \end{equation*}(21.78)Trκ=gijKij=gijKij=Kjj
and
(21.79) Tr K 2 = ( K 2 ) j j = K j m K m j = g j s K s m g m i K i j (21.79) Tr K 2 = K 2 j j = K j m K m j = g j s K s m g m i K i j {:(21.79)TrK^(2)=(K^(2))_(j)^(j)=K_(j)^(m)K_(m)^(j)=g_(js)K^(sm)g_(mi)K^(ij):}\begin{equation*} \operatorname{Tr} \boldsymbol{K}^{2}=\left(K^{2}\right)_{j}^{j}=K_{j}^{m} K_{m}{ }^{j}=g_{j s} K^{s m} g_{m i} K^{i j} \tag{21.79} \end{equation*}(21.79)TrK2=(K2)jj=KjmKmj=gjsKsmgmiKij
The result, though obtained in an orthonormal tetrad, plainly is covariant with respect to general coordinate transformations within the spacelike hypersurface; and it makes no explicit reference whatever to any time coordinate, in this respect providing a coordinate-free description of the Einstein curvature.
The Einstein field equation equates (21.77) to 8 π ρ 8 π ρ 8pi rho8 \pi \rho8πρ, where ρ ρ rho\rhoρ is the density of mass-energy. Expression (21.77) is the "measure of curvature that is independent of how curved one cuts a spacelike slice." This measure of curvature is central to the derivation of Einstein's field equation that is sketched in Box 17.2, item 3, "Physics on a Spacelike Slice."
The other component of the Einstein curvature tensor that is easily evaluated by (14.7) from the results at hand has the form
(21.80) G 1 n = ( 4 ) R n 2 12 + ( 4 ) R n 3 13 = ( n n ) 1 ( K 1 2 2 K 2 1 2 + K 1 3 3 K 3 1 3 ) , (21.80) G 1 n = ( 4 ) R n 2 12 + ( 4 ) R n 3 13 = ( n n ) 1 K 1 2 2 K 2 1 2 + K 1 3 3 K 3 1 3 , {:[(21.80)G_(1)^(n)=^((4))R^(n2)_(12)+^((4))R^(n3)_(13)],[=-(n*n)^(-1)(K_(1∣2)^(2)-K_(2∣1)^(2)+K_(1∣3)^(3)-K_(3∣1)^(3))","]:}\begin{align*} G_{1}^{n} & ={ }^{(4)} R^{n 2}{ }_{12}+{ }^{(4)} R^{n 3}{ }_{13} \tag{21.80}\\ & =-(\boldsymbol{n} \cdot \boldsymbol{n})^{-1}\left(K_{1 \mid 2}^{2}-K_{2 \mid 1}^{2}+K_{1 \mid 3}^{3}-K_{3 \mid 1}^{3}\right), \end{align*}(21.80)G1n=(4)Rn212+(4)Rn313=(nn)1(K122K212+K133K313),
when referred to an orthonormal frame. One immediately translates to a form valid for any frame e 1 , e 2 , e 3 e 1 , e 2 , e 3 e_(1),e_(2),e_(3)\boldsymbol{e}_{1}, \boldsymbol{e}_{2}, \boldsymbol{e}_{3}e1,e2,e3 in the hypersurface, orthonormal or not,
(21.81) G i n = ( n n ) 1 [ K i m m ( Tr K ) i ] . (21.81) G i n = ( n n ) 1 K i m m ( Tr K ) i . {:(21.81)G_(i)^(n)=-(n*n)^(-1)[K_(i∣m)^(m)-(Tr K)_(∣i)].:}\begin{equation*} G_{i}^{n}=-(\boldsymbol{n} \cdot \boldsymbol{n})^{-1}\left[K_{i \mid m}^{m}-(\operatorname{Tr} \boldsymbol{K})_{\mid i}\right] . \tag{21.81} \end{equation*}(21.81)Gin=(nn)1[Kimm(TrK)i].
Einstein curvature in terms of extrinsic curvature
Equation (21.77) is the central Einstein equation, "mass-energy fixes curvature"
The other initial-value equation
The Einstein field equation equates this quantity to 8 π 8 π 8pi8 \pi8π times the i i iii-th covariant component of the density of momentum carried by matter and fields other than gravity.
The four components of the Einstein field equation so far written down will have a central place in what follows as "initial-value equations" of general relativity. The other six components will not be written out: (1) the dynamics lets itself be analyzed more simply by Hamiltonian methods; and (2) the calculation takes work. It demands that one evaluate the remaining type of object, R ( e j , n ) e i R e j , n e i R(e_(j),n)e_(i)\mathscr{R}\left(\boldsymbol{e}_{j}, \boldsymbol{n}\right) \boldsymbol{e}_{i}R(ej,n)ei. One step towards that calculation will be found in exercise 21.7. Sachs does the calculation (1964, equation 10) but only after specializing to Gaussian normal coordinates. These coordinates presuppose a very special slicing of spacetime: (1) geodesics issuing normally from the spacelike hypersurface n = 0 n = 0 n=0n=0n=0 cut all subsequent simultaneities n = n = n=n=n= constant normally; and (2) the n n nnn coordinate directiy measures lapse of proper time, or proper length, whichever is appropriate,* along these geodesics. In coordinates so special it is not surprising that the answer looks simple:
(21.82) ( 4 ) R n i n k = ( n n ) 1 ( K i k n + K i m K k m ) . ( Gaussian normal coordinates ) (21.82) ( 4 ) R n i n k = ( n n ) 1 K i k n + K i m K k m . (  Gaussian normal   coordinates  ) {:(21.82)^((4))R^(n)_(ink)=(n*n)^(-1)((delK_(ik))/(del n)+K_(im)K_(k)^(m)).quad((" Gaussian normal ")/(" coordinates ")):}\begin{equation*} { }^{(4)} R^{n}{ }_{i n k}=(\boldsymbol{n} \cdot \boldsymbol{n})^{-1}\left(\frac{\partial K_{i k}}{\partial n}+K_{i m} K_{k}^{m}\right) . \quad\binom{\text { Gaussian normal }}{\text { coordinates }} \tag{21.82} \end{equation*}(21.82)(4)Rnink=(nn)1(Kikn+KimKkm).( Gaussian normal  coordinates )
Additional terms come into (21.82) when one uses, instead of the Gaussian normal coordinate system, the coordinate system of Arnowitt, Deser, and Misner. The ADM coordinates are employed here because they allow one to analyze the dynamics as one wants to analyze the dynamics, with freedom to push the spacelike hypersurface ahead in time at different rates in different places ("many-fingered time"). Fischer (1971) shows how to evaluate and understand the geometric content of such formulas in a coordinate-free way by using the concept of Lie derivative of a tensor field, an introduction to which is provided by exercise 21.8.

EXERCISES

Exercise 21.4. sCALAR CURVATURE INVARIANT IN TERMS OF AREA DEFICIT

It being 10 , 000 km 10 , 000 km 10,000km10,000 \mathrm{~km}10,000 km from North Pole to equator, one would have 62 , 832 km 62 , 832 km 62,832km62,832 \mathrm{~km}62,832 km for the length of the "equator" if the earth were flat, as contrasted to the actual 40 , 000 km 40 , 000 km ∼40,000km\sim 40,000 \mathrm{~km}40,000 km, a difference reflecting the fact that the surface is curved up into closure. Turn from this "pre-problem" to the actual problem, a 3 -sphere
d s 2 = a 2 [ d χ 2 + sin 2 χ ( d θ 2 + sin 2 θ d ϕ 2 ) ] . d s 2 = a 2 d χ 2 + sin 2 χ d θ 2 + sin 2 θ d ϕ 2 . ds^(2)=a^(2)[dchi^(2)+sin^(2)chi(dtheta^(2)+sin^(2)theta dphi^(2))].d s^{2}=a^{2}\left[d \chi^{2}+\sin ^{2} \chi\left(d \theta^{2}+\sin ^{2} \theta d \phi^{2}\right)\right] .ds2=a2[dχ2+sin2χ(dθ2+sin2θdϕ2)].
Measure off from χ = 0 χ = 0 chi=0\chi=0χ=0 a 2 -sphere of proper radius ε = a χ ε = a χ epsi=a_(chi)\varepsilon=a_{\chi}ε=aχ. Determine the proper area of this 2 -sphere as a function of χ χ chi\chiχ. Verify that relation (21.50) on the area deficit gives in the limit ε 0 ε 0 epsi longrightarrow0\varepsilon \longrightarrow 0ε0 the correct result R = 6 / a 2 R = 6 / a 2 R=6//a^(2)R=6 / a^{2}R=6/a2. For a more ambitious exercise: (1) take a general (smooth) 3-geometry; (2) express the metric near any chosen point in terms of Riemann's normal coordinates as given in § 11.6 § 11.6 §11.6\S 11.6§11.6; (3) determine the locus of the set of points at the proper distance ε ε epsi\varepsilonε to the lowest interesting power of ε ε epsi\varepsilonε in terms of the spherical polar angles θ θ theta\thetaθ and ϕ ϕ phi\phiϕ (direction of start of geodesic of length ε ε epsi\varepsilonε ); (4) determine to the lowest interesting power of ε ε epsi\varepsilonε the proper area of the figure defined by these points; and thereby establish (21.50) [for more on this topic see, for example, Cartan (1946), pp. 252-256].

Exercise 21.5. EXTRINSIC CURVATURE TENSOR FOR SLICE OF FRIEDMANN GEOMETRY

Confirm the result (21.70) for the extrinsic curvature by direct calculation from formula (21.67).

Exercise 21.6. EVALUATION OF R ( e j , e k ) n R e j , e k n R(e_(j),e_(k))n\mathscr{R}\left(\boldsymbol{e}_{j}, \boldsymbol{e}_{k}\right) \boldsymbol{n}R(ej,ek)n

Evaluate this quantity along the model of (21.74) or otherwise. How can it be foreseen that the coefficient of n n n\boldsymbol{n}n in the result must vanish identically? Comparing coefficients of e m e m e_(m)\boldsymbol{e}_{m}em, find ( 4 ) R m n j k ( 4 ) R m n j k ^((4))R^(m)_(njk){ }^{(4)} R^{m}{ }_{n j k}(4)Rmnjk and test for equivalence to equation (21.76).

Exercise 21.7. EVALUATION OF THE COMMUTATOR [ e j , n e j , n e_(j),ne_{j}, nej,n ]

The evaluation of this commutator is a first step toward the calculation of a quantity like R ( e j , n ) e i R e j , n e i R(e_(j),n)e_(i)\mathscr{R}\left(\boldsymbol{e}_{j}, \boldsymbol{n}\right) \boldsymbol{e}_{i}R(ej,n)ei. Expressing e j e j e_(j)\boldsymbol{e}_{j}ej as the differential operator / x j / x j del//delx^(j)\partial / \partial x^{j}/xj, use (21.49) to represent n n n\boldsymbol{n}n also as a differential operator. In this way, show that the commutator in question has the value ( N , j / N ) n ( N m , j / N ) e m N , j / N n N m , j / N e m -(N_(,j)//N)n-(N^(m)_(,j)//N)e_(m)-\left(N_{, j} / N\right) \boldsymbol{n}-\left(N^{m}{ }_{, j} / N\right) \boldsymbol{e}_{m}(N,j/N)n(Nm,j/N)em.
Exercise 21.8. LIE DERIVATIVE OF A TENSOR (exercise provided by J. W. York, Jr.)
Define the Lie derivative of a tensor field and explore some of its properties. The Lie derivative along a vector field n n n\boldsymbol{n}n is a differential operator that operates on tensor fields T T T\boldsymbol{T}T of type ( r s ) ( r s ) ((r)/(s))\binom{r}{s}(rs), converting them into tensors n T n T छ_(n)T\boldsymbol{छ}_{n} \boldsymbol{T}nT, also of type ( r s ) ( r s ) ((r)/(s))\binom{r}{s}(rs). The Lie differentiation process obeys the usual chain rule and has additivity properties [compare equations ( 10.2 b , 10.2 c 10.2 b , 10.2 c 10.2b,10.2c10.2 \mathrm{~b}, 10.2 \mathrm{c}10.2 b,10.2c, 10.2 d ) for the covariant derivative]. For scalar functions f f fff, one has Z n f n [ f ] = f , μ n μ Z n f n [ f ] = f , μ n μ Z_(n)f-=n[f]=f_(,mu)n^(mu)\boldsymbol{\mathcal { Z }}_{\boldsymbol{n}} f \equiv \boldsymbol{n}[f]=f_{, \mu} n^{\mu}Znfn[f]=f,μnμ. The Lie derivative of a vector field u u u\boldsymbol{u}u along a vector field v v v\boldsymbol{v}v was defined in exercise 9.11 by
z u v [ u , v ] z u v [ u , v ] z_(u)v-=[u,v]\mathfrak{z}_{u} \boldsymbol{v} \equiv[\boldsymbol{u}, \boldsymbol{v}]zuv[u,v]
If the action of z n z n z_(n)\mathfrak{z}_{n}zn on 1 -forms is defined, the extension to tensors of general type will be simple, because the latter can always be decomposed into a sum of tensor products of vectors and 1 -forms. If σ σ sigma\boldsymbol{\sigma}σ is a 1 -form and v v v\boldsymbol{v}v is a vector, then one defines z n σ z n σ z_(n)sigma\boldsymbol{z}_{\boldsymbol{n}} \boldsymbol{\sigma}znσ to be that 1 -form satisfying
Z n σ , v = n [ σ , v ] σ , [ n , v ] Z n σ , v = n [ σ , v ] σ , [ n , v ] (:Z_(n)sigma,v:)=n[(:sigma,v:)]-(:sigma,[n,v]:)\left\langle\mathfrak{Z}_{n} \sigma, v\right\rangle=\boldsymbol{n}[\langle\boldsymbol{\sigma}, \boldsymbol{v}\rangle]-\langle\boldsymbol{\sigma},[\boldsymbol{n}, \boldsymbol{v}]\rangleZnσ,v=n[σ,v]σ,[n,v]
for arbitrary v v v\boldsymbol{v}v.
(a) Show that in a coordinate basis
E n σ = ( σ α , β n β + σ β n , α β ) d x α . E n σ = σ α , β n β + σ β n , α β d x α . E_(n)sigma=(sigma_(alpha,beta)n^(beta)+sigma_(beta)n_(,alpha)^(beta))dx^(alpha).\mathfrak{E}_{n} \sigma=\left(\sigma_{\alpha, \beta} n^{\beta}+\sigma_{\beta} n_{, \alpha}^{\beta}\right) \boldsymbol{d} x^{\alpha} .Enσ=(σα,βnβ+σβn,αβ)dxα.
(b) Show that in a coordinate basis
Z n T = ( T α β , μ n μ + T μ β n μ , α + T α μ n μ , β ) d x α d x β Z n T = T α β , μ n μ + T μ β n μ , α + T α μ n μ , β d x α d x β Z_(n)T=(T_(alpha beta,mu)n^(mu)+T_(mu beta)n^(mu)_(,alpha)+T_(alpha mu)n^(mu)_(,beta))dx^(alpha)ox dx^(beta)\mathfrak{Z}_{\boldsymbol{n}} \boldsymbol{T}=\left(T_{\alpha \beta, \mu} n^{\mu}+T_{\mu \beta} n^{\mu}{ }_{, \alpha}+T_{\alpha \mu} n^{\mu}{ }_{, \beta}\right) \boldsymbol{d} x^{\alpha} \otimes \boldsymbol{d} x^{\beta}ZnT=(Tαβ,μnμ+Tμβnμ,α+Tαμnμ,β)dxαdxβ
where T T T\boldsymbol{T}T is of type ( 0 2 ) ( 0 2 ) ((0)/(2))\binom{0}{2}(02).
(c) Show that in (a) and (b), all partial derivatives can be replaced by covariant derivatives. [Observe that Lie differentiation is defined independently of the existence of an affine connection. For more information, see, for example, Bishop and Goldberg (1968) and Schouten (1954)].

Exercise 21.9. EXPRESSION FOR DYNAMIC COMPONENTS OF THE

CURVATURE TENSOR (exercise provided by J. W. York, Jr.)
The Gauss-Codazzi equations can be viewed as giving 14 of the 20 algebraically independent components of the spacetime curvature tensor in terms of the intrinsic and extrinsic geometry of three-dimensional (non-null) hypersurfaces. In order to accomplish a space-plus-time splitting of the Hilbert Lagrangian g ( 4 ) R g ( 4 ) R sqrt(-g^((4)))R\sqrt{-g^{(4)}} Rg(4)R, one must express, in addition, the remaining
6 components of the curvature tensor in an analogous manner. It is convenient for this purpose to express all tensors as spacetime tensors, and to use Lie derivation in the direction of the timelike unit normal field of the spacelike hypersurfaces as a generalized notion of time differentiation. A number of preliminary results must be proven:
(a) Z u g μ v = u μ ; ν + u v ; μ Z u ( g μ ν + u μ u ν ) Z u ( γ μ ν ) = u μ ; ν + u v ; μ + u μ a v + a μ u v (a) Z u g μ v = u μ ; ν + u v ; μ Z u g μ ν + u μ u ν Z u γ μ ν = u μ ; ν + u v ; μ + u μ a v + a μ u v {:[(a)Z_(u)g_(mu v)=u_(mu;nu)+u_(v;mu)],[Z_(u)(g_(mu nu)+u_(mu)u_(nu))-=Z_(u)(gamma_(mu nu))],[=u_(mu;nu)+u_(v;mu)+u_(mu)a_(v)+a_(mu)u_(v)]:}\begin{align*} \mathscr{Z}_{u} g_{\mu v} & =u_{\mu ; \nu}+u_{v ; \mu} \tag{a}\\ \mathscr{Z}_{u}\left(g_{\mu \nu}+u_{\mu} u_{\nu}\right) & \equiv \mathscr{\mathscr { Z }}_{u}\left(\gamma_{\mu \nu}\right) \\ & =u_{\mu ; \nu}+u_{v ; \mu}+u_{\mu} a_{v}+a_{\mu} u_{v} \end{align*}(a)Zugμv=uμ;ν+uv;μZu(gμν+uμuν)Zu(γμν)=uμ;ν+uv;μ+uμav+aμuv
where γ μ ν γ μ ν gamma_(mu nu)\gamma_{\mu \nu}γμν is the metric of the spacelike hypersurface, expressed in the spacetime coordinate basis, and a μ u λ λ u μ a μ u λ λ u μ a^(mu)-=u^(lambda)grad_(lambda)u^(mu)a^{\mu} \equiv u^{\lambda} \nabla_{\lambda} u^{\mu}aμuλλuμ is the curvature vector (4-acceleration) of the timelike normal curves whose tangent field is u μ u μ u^(mu)u^{\mu}uμ. (Recall that u μ a μ = 0 u μ a μ = 0 u_(mu)a^(mu)=0u_{\mu} a^{\mu}=0uμaμ=0.)
(c) Prove that the extrinsic curvature tensor is given by
K μ ν = 1 2 z μ γ μ ν . K μ ν = 1 2 z μ γ μ ν . K_(mu nu)=-(1)/(2)z_(mu)gamma_(mu nu).K_{\mu \nu}=-\frac{1}{2} \mathscr{z}_{\mu} \gamma_{\mu \nu} .Kμν=12zμγμν.
(d) The unit tensor of projection into the hypersurface is defined by
ν μ δ v μ + u μ u v ν μ δ v μ + u μ u v _|__(nu)^(mu)-=delta_(v)^(mu)+u^(mu)u_(v)\perp_{\nu}^{\mu} \equiv \delta_{v}^{\mu}+u^{\mu} u_{v}νμδvμ+uμuv
In terms of _|_\perp show that one can write
u α ; β K α β ω α β a α u β , u α ; β K α β ω α β a α u β , u_(alpha;beta)-=-K_(alpha beta)-omega_(alpha beta)-a_(alpha)u_(beta),u_{\alpha ; \beta} \equiv-K_{\alpha \beta}-\omega_{\alpha \beta}-a_{\alpha} u_{\beta},uα;βKαβωαβaαuβ,
where
K α β = α μ β p u ( μ ; p ) K α β = α μ β p u ( μ ; p ) K_(alpha beta)=-_|__(alpha)^(mu)_|__(beta)^(p)u_((mu;p))K_{\alpha \beta}=-\perp_{\alpha}^{\mu} \perp_{\beta}^{p} u_{(\mu ; p)}Kαβ=αμβpu(μ;p)
and
ω α β = α μ β μ u [ μ ; j ] . ω α β = α μ β μ u [ μ ; j ] . omega_(alpha beta)=-_|__(alpha)^(mu)_|__(beta)^(mu)u_([mu;j]).\omega_{\alpha \beta}=-\perp_{\alpha}^{\mu} \perp_{\beta}^{\mu} u_{[\mu ; j]} .ωαβ=αμβμu[μ;j].
(e) From the fact that u μ u μ u^(mu)u^{\mu}uμ is the unit normal field for a family of spacelike hypersurfaces, show that ω α β = 0 ω α β = 0 omega_(alpha beta)=0\omega_{\alpha \beta}=0ωαβ=0.
(f) The needed tools are now on hand. To obtain the result:
(i) Write down E u K μ ν E u K μ ν E_(u)K_(mu nu)\mathscr{E}_{u} K_{\mu \nu}EuKμν (see exercise 21.8);
(ii) Insert this expression into the Ricci identity in the form
u σ σ μ u v = u σ μ σ u v + ( 4 ) R ρ ν μ σ u σ u ρ ; u σ σ μ u v = u σ μ σ u v + ( 4 ) R ρ ν μ σ u σ u ρ ; u^(sigma)grad_(sigma)grad_(mu)u_(v)=u^(sigma)grad_(mu)grad_(sigma)u_(v)+^((4))R_(rho nu mu sigma)u^(sigma)u^(rho);u^{\sigma} \nabla_{\sigma} \nabla_{\mu} u_{v}=u^{\sigma} \nabla_{\mu} \nabla_{\sigma} u_{v}+{ }^{(4)} R_{\rho \nu \mu \sigma} u^{\sigma} u^{\rho} ;uσσμuv=uσμσuv+(4)Rρνμσuσuρ;
(iii) Project the two remaining free indices into the hypersurface using _|_\perp, and show that one obtains
α μ β ρ β ( 4 ) R μ ν ρ σ u ν u σ = z u K α β + K α γ K γ + ( 3 ) ( α a β ) + a α a β , α μ β ρ β ( 4 ) R μ ν ρ σ u ν u σ = z u K α β + K α γ K γ + ( 3 ) ( α a β ) + a α a β , {:[_|__(alpha)^(mu)_|__(beta)^(rho)_(beta)^((4))R_(mu nu rho sigma)u^(nu)u^(sigma)=z_(u)K_(alpha beta)+K_(alpha gamma)K gamma],[+^((3))grad_((alpha)a_(beta))+a_(alpha)a_(beta)","]:}\begin{gathered} \perp_{\alpha}^{\mu} \perp_{\beta}^{\rho}{ }_{\beta}^{(4)} R_{\mu \nu \rho \sigma} u^{\nu} u^{\sigma}=\mathfrak{z}_{u} K_{\alpha \beta}+K_{\alpha \gamma} K \gamma \\ +{ }^{(3)} \nabla_{(\alpha} a_{\beta)}+a_{\alpha} a_{\beta}, \end{gathered}αμβρβ(4)Rμνρσuνuσ=zuKαβ+KαγKγ+(3)(αaβ)+aαaβ,
where ( 3 ) α a β α μ β β μ a v ( 3 ) α a β α μ β β μ a v ^((3))grad_(alpha)a_(beta)-=_|__(alpha)^(mu)_|__(beta)^(_(beta))grad_(mu)a_(v){ }^{(3)} \nabla_{\alpha} a_{\beta} \equiv \perp_{\alpha}^{\mu} \perp_{\beta}^{{ }_{\beta}} \nabla_{\mu} a_{v}(3)αaβαμββμav can be shown to be the three-dimensional covariant derivative of a β a β a_(beta)a_{\beta}aβ. In Gaussian normal coordinates, show that one obtains from this result
R 0 i 0 j = t K i j + K i k K j k . R 0 i 0 j = t K i j + K i k K j k . R_(0i0j)=(del)/(del t)K_(ij)+K_(ik)K_(j)^(k).R_{0 i 0 j}=\frac{\partial}{\partial t} K_{i j}+K_{i k} K_{j}^{k} .R0i0j=tKij+KikKjk.
(g) Finally, in the construction of ( 4 ) R ( 4 ) R ^((4))R{ }^{(4)} R(4)R, one needs to show that
γ μ ν [ ( 3 ) ( μ a v ) + a μ a v ] = g μ ν η [ ( 3 ) ( μ a v ) + a μ a v ] = a ; λ λ . γ μ ν ( 3 ) ( μ a v ) + a μ a v = g μ ν η ( 3 ) ( μ a v ) + a μ a v = a ; λ λ . gamma^(mu nu)[(3)grad_((mu)a_(v))+a_(mu)a_(v)]=g^(mu nu eta)[^((3))grad_((mu)a_(v))+a_(mu)a_(v)]=a_(;lambda)^(lambda).\gamma^{\mu \nu}\left[(3) \nabla_{(\mu} a_{v)}+a_{\mu} a_{v}\right]=g^{\mu \nu \eta}\left[{ }^{(3)} \nabla_{(\mu} a_{v)}+a_{\mu} a_{v}\right]=a_{; \lambda}^{\lambda} .γμν[(3)(μav)+aμav]=gμνη[(3)(μav)+aμav]=a;λλ.
Exercise 21.10. EXPRESSION OF ( 4 ) R i nin ( 4 ) R i nin  ^((4))R^(i)_("nin "){ }^{(4)} R^{i}{ }_{\text {nin }}(4)Rinin  IN TERMS OF EXTRINSIC
CURVATURE, PLUS A COVARIANT DIVERGENCE
(exercise provided by K. Kuchaŕ)
Let α α alpha^(')\alpha^{\prime}α be an arbitrary smooth set of four coordinates, not necessarily coordinated in any way with the choice of the 1-parameter family of hypersurfaces.
(a) Show that
( 4 ) R n i n i = g α γ n β ( n α ; β γ n α ; γ β ) ( 4 ) R n i n i = g α γ n β n α ; β γ n α ; γ β ^((4))R_(nin)^(i)=g^(alpha^(')gamma^('))n^(beta^('))(n_(alpha^(');beta^(')gamma^('))-n_(alpha^(');gamma^(')beta^('))){ }^{(4)} R_{n i n}^{i}=g^{\alpha^{\prime} \gamma^{\prime}} n^{\beta^{\prime}}\left(n_{\alpha^{\prime} ; \beta^{\prime} \gamma^{\prime}}-n_{\alpha^{\prime} ; \gamma^{\prime} \beta^{\prime}}\right)(4)Rnini=gαγnβ(nα;βγnα;γβ)
(b) Show that the covariant divergences
( n β n γ ; β ) ; γ n β n γ ; β ; γ (n^(beta^('))n^(gamma^('))_(;beta^(')))_(;gamma^('))\left(n^{\beta^{\prime}} n^{\gamma^{\prime}}{ }_{; \beta^{\prime}}\right)_{; \gamma^{\prime}}(nβnγ;β);γ
and
( n β n γ ; γ ; β n β n γ ; γ ; β -(n^(beta^('))n^(gamma^('));gamma_(;beta^(')):}-\left(n^{\beta^{\prime}} n^{\gamma^{\prime}} ; \gamma_{; \beta^{\prime}}\right.(nβnγ;γ;β
can be removed from this expression in such a way that what is left behind contains only first derivatives of the unit normal vector n n n\boldsymbol{n}n.
(c) Noting that the basis vectors e i e i e_(i)\boldsymbol{e}_{i}ei and n n n\boldsymbol{n}n form a complete set, justify the formula
g β μ = e i β ω i μ + ( n n ) 1 n β n μ , g β μ = e i β ω i μ + ( n n ) 1 n β n μ , g^(beta^(')mu^('))=e_(i)beta^(')omega^(imu^('))+(n*n)^(-1)n^(beta^('))n^(mu^(')),g^{\beta^{\prime} \mu^{\prime}}=e_{i} \beta^{\prime} \omega^{i \mu^{\prime}}+(\boldsymbol{n} \cdot \boldsymbol{n})^{-1} n^{\beta^{\prime}} n^{\mu^{\prime}},gβμ=eiβωiμ+(nn)1nβnμ,
where ω i ω i omega^(i)\omega^{i}ωi is the 1 -form dual to e i e i e_(i)\boldsymbol{e}_{i}ei.
(d) Noting that n α ; β n α = 0 n α ; β n α = 0 n_(alpha^(');beta^('))n^(alpha^('))=0n_{\alpha^{\prime} ; \beta^{\prime}} n^{\alpha^{\prime}}=0nα;βnα=0 and
K i j = e i α n α ; β e j β K i j = e i α n α ; β e j β K_(ij)=-e_(i alpha)n^(alpha^('))_(;beta^('))e_(j)^(beta^('))K_{i j}=-e_{i \alpha} n^{\alpha^{\prime}}{ }_{; \beta^{\prime}} e_{j}^{\beta^{\prime}}Kij=eiαnα;βejβ
show that
( 4 ) R n i n i = ( Tr K ) 2 Tr K 2 plus a covariant divergence. ( 4 ) R n i n i = ( Tr K ) 2 Tr K 2  plus a covariant divergence.  ^((4))R_(nin)^(i)=(Tr K)^(2)-TrK^(2)" plus a covariant divergence. "{ }^{(4)} R_{n i n}^{i}=(\operatorname{Tr} \boldsymbol{K})^{2}-\operatorname{Tr} \boldsymbol{K}^{2} \text { plus a covariant divergence. }(4)Rnini=(TrK)2TrK2 plus a covariant divergence. 

§21.6. THE HILBERT ACTION PRINCIPLE AND THE ARNOWITT-DESER-MISNER MODIFICATION THEREOF IN THE SPACE-PLUS-TIME SPLIT

For analyzing the dynamics, it happily proves unnecessary to possess the missing formula for ( 4 ) R n i n k ( 4 ) R n i n k ^((4))R^(n)_(ink){ }^{(4)} R^{n}{ }_{i n k}(4)Rnink. It is essential, however, to have the Lagrangian density,
(21.83) 16 π L geom = ( ( 4 ) g ) 1 / 2 ( 4 ) R (21.83) 16 π L geom  = ( 4 ) g 1 / 2 ( 4 ) R {:(21.83)16 piL_("geom ")=(-^((4))g)^(1//2(4))R:}\begin{equation*} 16 \pi \mathcal{L}_{\text {geom }}=\left(-^{(4)} g\right)^{1 / 2(4)} R \tag{21.83} \end{equation*}(21.83)16πLgeom =((4)g)1/2(4)R
in the Hilbert action principle as the heart of all the dynamic analysis. In the present ADM (1962) notation, this density has the form
( ( 4 ) g ) 1 / 2 ( 4 ) R = ( ( 4 ) g ) 1 / 2 [ ( 4 ) R i j i j + 2 ( 4 ) R i n i n ] (21.84) = ( ( 4 ) g ) 1 / 2 [ R + ( n n ) ( Tr K 2 ( Tr K ) 2 ) + 2 ( n n ) ( 4 ) R n i n i ] ( 4 ) g 1 / 2 ( 4 ) R = ( 4 ) g 1 / 2 ( 4 ) R i j i j + 2 ( 4 ) R i n i n (21.84) = ( 4 ) g 1 / 2 R + ( n n ) Tr K 2 ( Tr K ) 2 + 2 ( n n ) ( 4 ) R n i n i {:[(-^((4))g)^(1//2(4))R=(-^((4))g)^(1//2)[^((4))R^(ij)_(ij)+2^((4))R^(in)_(in)]],[(21.84)=(-^((4))g)^(1//2)[R+(n*n)(TrK^(2)-(Tr K)^(2))+2(n*n)^((4))R_(nin)^(i)]]:}\begin{align*} \left(-{ }^{(4)} g\right)^{1 / 2(4)} R & =\left(-{ }^{(4)} g\right)^{1 / 2}\left[{ }^{(4)} R^{i j}{ }_{i j}+2^{(4)} R^{i n}{ }_{i n}\right] \\ & =\left(-{ }^{(4)} g\right)^{1 / 2}\left[R+(\boldsymbol{n} \cdot \boldsymbol{n})\left(\operatorname{Tr} \boldsymbol{K}^{2}-(\operatorname{Tr} \boldsymbol{K})^{2}\right)+2(\boldsymbol{n} \cdot \boldsymbol{n})^{(4)} R_{n i n}^{i}\right] \tag{21.84} \end{align*}((4)g)1/2(4)R=((4)g)1/2[(4)Rijij+2(4)Rinin](21.84)=((4)g)1/2[R+(nn)(TrK2(TrK)2)+2(nn)(4)Rnini]
Kuchař (1971b; see also exercise 21.10) shows how to calculate a sufficient part of this quantity without calculating all of it. The difference between the "sufficient part" and the "whole" is a time derivative plus a divergence, a quantity of the form
(21.85) [ ( ( 4 ) g ) 1 / 2 A α ] , α = ( ( 4 ) g ) 1 / 2 A ; α α (21.85) ( 4 ) g 1 / 2 A α , α = ( 4 ) g 1 / 2 A ; α α {:(21.85)[(-^((4))g)^(1//2)A^(alpha)]_(,alpha)=(-^((4))g)^(1//2)A_(;alpha^(**))^(alpha):}\begin{equation*} \left[\left(-{ }^{(4)} g\right)^{1 / 2} A^{\alpha}\right]_{, \alpha}=\left(-{ }^{(4)} g\right)^{1 / 2} A_{; \alpha^{*}}^{\alpha} \tag{21.85} \end{equation*}(21.85)[((4)g)1/2Aα],α=((4)g)1/2A;αα
Drop a complete derivative from the Hilbert action principle to get the ADM principle
When one multiplies (21.83) by d t d x 1 d x 2 d x 3 d t d x 1 d x 2 d x 3 dtdx^(1)dx^(2)dx^(3)d t d x^{1} d x^{2} d x^{3}dtdx1dx2dx3 and integrates to obtain the action integral, the term (21.85) integrates out to a surface term. Variations of the geometry interior to this surface make no difference in the value of this surface term. Therefore it has no influence on the equations of motion to drop the term (21.85). The result of the calculation (exercise 21.10) is simple: what is left over after dropping the divergence merely changes the sign of the terms in Tr K 2 Tr K 2 TrK^(2)\operatorname{Tr} \boldsymbol{K}^{2}TrK2 and ( Tr K ) 2 ( Tr K ) 2 (Tr K)^(2)(\operatorname{Tr} \boldsymbol{K})^{2}(TrK)2 in (21.84). Thus the variation principle becomes
(21.86) ( extremum ) = I modified = E modified d 4 x = ( 1 / 16 π ) [ R + ( n n ) ( ( Tr K ) 2 Tr K 2 ) ] N g 1 / 2 d t d 3 x + E fields d 4 x . (21.86) (  extremum  ) = I modified  = E modified  d 4 x = ( 1 / 16 π ) R + ( n n ) ( Tr K ) 2 Tr K 2 N g 1 / 2 d t d 3 x + E fields  d 4 x . {:[(21.86)(" extremum ")=I_("modified ")=intE_("modified ")d^(4)x],[quad=(1//16 pi)int[R+(n*n)((Tr K)^(2)-TrK^(2))]Ng^(1//2)dtd^(3)x+intE_("fields ")d^(4)x.]:}\begin{align*} & (\text { extremum })=I_{\text {modified }}=\int \mathcal{E}_{\text {modified }} d^{4} x \tag{21.86}\\ & \quad=(1 / 16 \pi) \int\left[R+(\boldsymbol{n} \cdot \boldsymbol{n})\left((\operatorname{Tr} \boldsymbol{K})^{2}-\operatorname{Tr} \boldsymbol{K}^{2}\right)\right] N g^{1 / 2} d t d^{3} x+\int \mathcal{E}_{\text {fields }} d^{4} x . \end{align*}(21.86)( extremum )=Imodified =Emodified d4x=(1/16π)[R+(nn)((TrK)2TrK2)]Ng1/2dtd3x+Efields d4x.
This expression, rephrased, is the starting point for Arnowitt, Deser, and Misner's analysis of the dynamics of geometry.
Two supplements from a paper of York (1972b; see also exercise 21.9) enlarge one's geometric insight into what is going on in the foregoing analysis. First, the tensor of extrinsic curvature lets itself be defined [see also Fischer (1971)] most naturally in the form
(21.87) K = 1 2 z n g (21.87) K = 1 2 z n g {:(21.87)K=-(1)/(2)z_(n)g:}\begin{equation*} \boldsymbol{K}=-\frac{1}{2} \mathscr{z}_{n} \boldsymbol{g} \tag{21.87} \end{equation*}(21.87)K=12zng
where g g g\boldsymbol{g}g is the metric tensor of the 3-geometry, n n n\boldsymbol{n}n is the timelike unit normal field, and L L L\mathscr{\mathscr { L }}L is the Lie derivative as defined in exercise 21.8. Second, the divergence (21.85), which has to be added to the Lagrangian of ( 21.86 ) ( 21.86 ) (21.86)(21.86)(21.86) to obtain the full Hilbert Lagrangian, is
(21.88) 2 [ ( ( 4 ) g ) 1 / 2 ( n α Tr K + a α ) ] , α , (21.88) 2 ( 4 ) g 1 / 2 n α Tr K + a α , α , {:(21.88)-2[(-^((4))g)^(1//2)(n^(alpha^('))Tr K+a^(alpha^(')))]","alpha^(')",":}\begin{equation*} -2\left[\left(-{ }^{(4)} g\right)^{1 / 2}\left(n^{\alpha^{\prime}} \operatorname{Tr} \boldsymbol{K}+a^{\alpha^{\prime}}\right)\right], \alpha^{\prime}, \tag{21.88} \end{equation*}(21.88)2[((4)g)1/2(nαTrK+aα)],α,
where the coordinates are general (see exercise 21.10), and
(21.89) a α = n α ; β n β (21.89) a α = n α ; β n β {:(21.89)a^(alpha^('))=n^(alpha^('))_(;beta^('))n^(beta^(')):}\begin{equation*} a^{\alpha^{\prime}}=n^{\alpha^{\prime}}{ }_{; \beta^{\prime}} n^{\beta^{\prime}} \tag{21.89} \end{equation*}(21.89)aα=nα;βnβ
is the 4-acceleration of an observer traveling along the timelike normal n n n\boldsymbol{n}n to the successive slices.

§21.7. THE ARNOWITT, DESER, AND MISNER FORMULATION OF THE DYNAMICS OF GEOMETRY

Dirac (1959, 1964, and earlier references cited therein) formulated the dynamics of geometry in a ( 3 + 1 ) ( 3 + 1 ) (3+1)(3+1)(3+1)-dimensional form, using generalizations of Poisson brackets and of Hamilton equations. Arnowitt, Deser, and Misner instead made the HilbertPalatini variational principle the foundation for this dynamics. Because of its simplicity, this ADM (1962) approach is followed here. The gravitational part of the integrand in the Hilbert-Palatini action principle is rewritten in the condensed but standard form (after inserting a 16 π 16 π 16 pi16 \pi16π that ADM avoid by other units) as
16 π L geom true = L geom ADM = g i j π i j t N K N i K i (21.90) 2 [ π i j N j 1 2 N i Tr π + N i ( g ) 1 / 2 ] , i . 16 π L geom true  = L geom ADM  = g i j π i j t N K N i K i (21.90) 2 π i j N j 1 2 N i Tr π + N i ( g ) 1 / 2 , i . {:[16 piL_("geom true ")=L_("geom ADM ")=-g_(ij)delpi^(ij)del t-NK-N_(i)K^(i)],[(21.90)-2[pi^(ij)N_(j)-(1)/(2)N^(i)Tr pi+N^(∣i)(g)^(1//2)]_(,i).]:}\begin{align*} 16 \pi \mathcal{L}_{\text {geom true }}= & \mathcal{L}_{\text {geom ADM }}=-g_{i j} \partial \pi^{i j} \partial t-N \mathscr{K}-N_{i} \mathscr{K}^{i} \\ & -2\left[\pi^{i j} N_{j}-\frac{1}{2} N^{i} \operatorname{Tr} \pi+N^{\mid i}(g)^{1 / 2}\right]_{, i} . \tag{21.90} \end{align*}16πLgeom true =Lgeom ADM =gijπijtNKNiKi(21.90)2[πijNj12NiTrπ+Ni(g)1/2],i.
Here each item of abbreviation has its special meaning and will play its special part, a part foreshadowed by the name now given it:
π true i j = δ ( action ) δ g i j = ( "geometrodynamic field momentum" dyn- amically conjugate to the "geometrodynamic field coordinate" g i j ) = π i j 16 π ; π i j = g 1 / 2 ( g i j Tr K K i j ) π true  i j = δ (  action  ) δ g i j =  "geometrodynamic   field momentum" dyn-   amically conjugate to   the "geometrodynamic   field coordinate"  g i j = π i j 16 π ; π i j = g 1 / 2 g i j Tr K K i j pi_("true ")^(ij)=(delta(" action "))/(deltag_(ij))=([" "geometrodynamic "],[" field momentum" dyn- "],[" amically conjugate to "],[" the "geometrodynamic "],[" field coordinate" "g_(ij)])=(pi^(ij))/(16 pi);pi^(ij)=g^(1//2)(g^(ij)Tr K-K^(ij))\pi_{\text {true }}^{i j}=\frac{\delta(\text { action })}{\delta g_{i j}}=\left(\begin{array}{l} \text { "geometrodynamic } \\ \text { field momentum" dyn- } \\ \text { amically conjugate to } \\ \text { the "geometrodynamic } \\ \text { field coordinate" } g_{i j} \end{array}\right)=\frac{\pi^{i j}}{16 \pi} ; \pi^{i j}=g^{1 / 2}\left(g^{i j} \operatorname{Tr} \boldsymbol{K}-K^{i j}\right)πtrue ij=δ( action )δgij=( "geometrodynamic  field momentum" dyn-  amically conjugate to  the "geometrodynamic  field coordinate" gij)=πij16π;πij=g1/2(gijTrKKij)
Momenta conjugate to the dynamic g i j g i j g_(ij)g_{i j}gij
(here the π i j π i j pi^(ij)\pi^{i j}πij of ADM is usually more convenient than π true i j π true  i j pi_("true ")^(ij)\pi_{\text {true }}^{i j}πtrue ij ); and
(21.92) K true = H ( π true i j , g i j ) = ( "super-Hamiltonian" ) = H / 16 π ; K ( π i j , g i j ) = g 1 / 2 ( Tr n 2 1 2 ( Tr π ) 2 ) g 1 / 2 R ; (21.92) K true  = H π true  i j , g i j = (  "super-Hamiltonian"  ) = H / 16 π ; K π i j , g i j = g 1 / 2 Tr n 2 1 2 ( Tr π ) 2 g 1 / 2 R ; {:[(21.92)K_("true ")=H(pi_("true ")^(ij),g_(ij))=(" "super-Hamiltonian" ")=H//16 pi;],[K(pi^(ij),g_(ij))=g^(-1//2)(Trn^(2)-(1)/(2)(Tr pi)^(2))-g^(1//2)R;]:}\begin{align*} \mathscr{K}_{\text {true }} & =\mathscr{H}\left(\pi_{\text {true }}^{i j}, g_{i j}\right)=(\text { "super-Hamiltonian" })=\mathscr{H} / 16 \pi ; \tag{21.92}\\ \mathscr{K}\left(\pi^{i j}, g_{i j}\right) & =g^{-1 / 2}\left(\operatorname{Tr} \boldsymbol{n}^{2}-\frac{1}{2}(\operatorname{Tr} \boldsymbol{\pi})^{2}\right)-g^{1 / 2} R ; \end{align*}(21.92)Ktrue =H(πtrue ij,gij)=( "super-Hamiltonian" )=H/16π;K(πij,gij)=g1/2(Trn212(Trπ)2)g1/2R;
and
(21.93) 16 π H true i = K i = K i ( π i j , g i j ) = ( "supermomentum" ) = 2 π i k k . (21.93) 16 π H true  i = K i = K i π i j , g i j = (  "supermomentum"  ) = 2 π i k k . {:(21.93)16 piH_("true ")^(i)=K^(i)=K^(i)(pi^(ij),g_(ij))=(" "supermomentum" ")=-2pi^(ik)_(∣k).:}\begin{equation*} 16 \pi \mathscr{H}_{\text {true }}^{i}=\mathscr{K}^{i}=\mathscr{K}^{i}\left(\pi^{i j}, g_{i j}\right)=(\text { "supermomentum" })=-2 \pi^{i k}{ }_{\mid k} . \tag{21.93} \end{equation*}(21.93)16πHtrue i=Ki=Ki(πij,gij)=( "supermomentum" )=2πikk.
Here the covariant derivative is formed treating π i k π i k pi^(ik)\pi^{i k}πik as a tensor density, as its definition in (21.91) shows it to be (see §21.2). The quantities to be varied to extremize the action are the coefficients in the metric of the 4-geometry, as follows: the six g i j g i j g_(ij)g_{i j}gij and the lapse function N N NNN and shift function N i N i N_(i)N_{i}Ni; and also the six "geometrodynamic momenta," π i j π i j pi^(ij)\pi^{i j}πij. To vary these momenta as well as the metric is (1) to follow the pattern of elementary Hamiltonian dynamics (Box 21.1), where, by taking the momentum p p ppp to be as independently variable as the coordinate x x xxx, one arrives at two Hamilton equations of the first order instead of one Lagrange equation of the second order, and (2) to follow in some measure the lead of the Palatini variation principle of $ 21.2 $ 21.2 $21.2\$ 21.2$21.2. There, however, one had 40 connection coefficients to vary, whereas here one has come down to only six π i j π i j pi^(ij)\pi^{i j}πij. To know these momenta and the 3-metric is to know the extrinsic curvature. Before carrying out the variation, drop the divergence 2 [ ] , i 2 [ ] , i -2[quad]_(,i)-2[\quad]_{, i}2[],i from (21.90), since it gives rise only to surface integrals and therefore in no way affects the equations of motion that will come out of the variational principle. Also rewrite the first term in (21.90) in the form
(21.94) ( / t ) ( g i j π i j ) + π i j g i j / t , (21.94) ( / t ) g i j π i j + π i j g i j / t , {:(21.94)-(del//del t)(g_(ij)pi^(ij))+pi^(ij)delg_(ij)//del t",":}\begin{equation*} -(\partial / \partial t)\left(g_{i j} \pi^{i j}\right)+\pi^{i j} \partial g_{i j} / \partial t, \tag{21.94} \end{equation*}(21.94)(/t)(gijπij)+πijgij/t,
and drop the complete time-derivative from the variation principle, again because it is irrelevant to the resulting equations of motion. The action principle now takes the form
extremum = I true = I ADM / 16 π = ( 1 / 16 π ) [ π i j g i j / t N K ( π i j , g i j ) N i K i ( π i j , g i j ) ] d 4 x (21.95) + E field d 4 x .  extremum  = I true  = I ADM / 16 π = ( 1 / 16 π ) π i j g i j / t N K π i j , g i j N i K i π i j , g i j d 4 x (21.95) + E field  d 4 x . {:[" extremum "=I_("true ")=I_(ADM)//16 pi],[=(1//16 pi)int[pi^(ij)delg_(ij)//del t-NK(pi^(ij),g_(ij))-N_(i)K^(i)(pi^(ij),g_(ij))]d^(4)x],[(21.95)+intE_("field ")d^(4)x.]:}\begin{align*} \text { extremum }= & I_{\text {true }}=I_{\mathrm{ADM}} / 16 \pi \\ = & (1 / 16 \pi) \int\left[\pi^{i j} \partial g_{i j} / \partial t-N \mathscr{K}\left(\pi^{i j}, g_{i j}\right)-N_{i} \mathscr{K}^{i}\left(\pi^{i j}, g_{i j}\right)\right] d^{4} x \\ & +\int \mathcal{E}_{\text {field }} d^{4} x . \tag{21.95} \end{align*} extremum =Itrue =IADM/16π=(1/16π)[πijgij/tNK(πij,gij)NiKi(πij,gij)]d4x(21.95)+Efield d4x.
The action principle itself, here as always, tells one what must be fixed to make the action take on a well-defined value (if and when the action possesses an extremum). Apart from appropriate potentials having to do with fields other than geom-
Action principle says, fix 3-geometry on each face of sandwich
What a 3-geometry is
Electromagnetism gives example of momentum conjugate to "field coordinate"
etry, the only quantities that have to be fixed appear at first sight to be the values of the six g i j g i j g_(ij)g_{i j}gij on the initial and final spacelike hypersurfaces. However, the ADM action principle is invariant with respect to any change of coordinates x 1 , x 2 , x 3 x 1 , x 2 , x 3 x^(1),x^(2),x^(3)x^{1}, x^{2}, x^{3}x1,x2,x3 x 1 , x 2 , x 3 x 1 ¯ , x 2 ¯ , x 3 ¯ longrightarrowx^( bar(1)),x^( bar(2)),x^( bar(3))\longrightarrow x^{\overline{1}}, x^{\overline{2}}, x^{\overline{3}}x1,x2,x3 within the successive spacelike slices. Therefore the quantities that really have to be fixed on the two faces of the sandwich are the 3 -geometries ( 3 ) g ( 3 ) g ^((3))g^('){ }^{(3)} \mathrm{g}^{\prime}(3)g (on the initial hypersurface) and ( 3 ) ( 3 ) ^((3))ℓ{ }^{(3)} \ell(3) (on the final hypersurface) and nothing more.
In mathematical terms, a 3-geometry ( 3 ) ( 3 ) ^((3))ℓ{ }^{(3)} \ell(3) is the "equivalence class" of a set of differentiable manifolds that are isometrically equivalent to each other under diffeomorphisms. In the terms of the everyday physicist, a 3-geometry is the equivalence class of 3 -metrics g i j ( x , y , z ) g i j ( x , y , z ) g_(ij)(x,y,z)g_{i j}(x, y, z)gij(x,y,z) that are equivalent to one another under coordinate transformations. In more homely terms, two automobile fenders have one and the same 2-geometry if they have the same shape, regardless of how much the coordinate rulings painted on the one may differ from the coordinate rulings painted on the other.
To have in equation (21.95) an example of a field Lagrangian that is at the same time physically relevant and free of avoidable complications, take the case of a source-free electromagnetic field. It would be possible to take the field Lagrangian to have the standard Maxwell value,
(21.96) ( 1 / 8 π ) ( E 2 B 2 ) ( 1 / 16 π ) F μ ν F μ ν (21.96) ( 1 / 8 π ) E 2 B 2 ( 1 / 16 π ) F μ ν F μ ν {:(21.96)(1//8pi)(E^(2)-B^(2))longrightarrow-(1//16 pi)F_(mu nu)F^(mu nu):}\begin{equation*} (1 / 8 \pi)\left(\boldsymbol{E}^{2}-\boldsymbol{B}^{2}\right) \longrightarrow-(1 / 16 \pi) F_{\mu \nu} F^{\mu \nu} \tag{21.96} \end{equation*}(21.96)(1/8π)(E2B2)(1/16π)FμνFμν
with
(21.97) F μ ν = A ν / x μ A μ / x ν (21.97) F μ ν = A ν / x μ A μ / x ν {:(21.97)F_(mu nu)=delA_(nu)//delx^(mu)-delA_(mu)//delx^(nu):}\begin{equation*} F_{\mu \nu}=\partial A_{\nu} / \partial x^{\mu}-\partial A_{\mu} / \partial x^{\nu} \tag{21.97} \end{equation*}(21.97)Fμν=Aν/xμAμ/xν
The variation of the Lagrangian with respect to the independent dynamic variables of the field, the four potentials A α A α A_(alpha)A_{\alpha}Aα, would then immediately give the four second-order partial differential wave equations for these four potentials. However, to have instead a larger number of first-order equations is as convenient for electrodynamics as it is for geometrodynamics. One seeks for the analog of the Hamiltonian equations of particle dynamics,
(21.98) d x / d t = H ( x , p ) / p d p / d t = H ( x , p ) / x (21.98) d x / d t = H ( x , p ) / p d p / d t = H ( x , p ) / x {:[(21.98)dx//dt=del H(x","p)//del p],[dp//dt=-del H(x","p)//del x]:}\begin{align*} d x / d t & =\partial H(x, p) / \partial p \tag{21.98}\\ d p / d t & =-\partial H(x, p) / \partial x \end{align*}(21.98)dx/dt=H(x,p)/pdp/dt=H(x,p)/x
One gets those equations by replacing the Lagrange integral L ( x , x ˙ ) d t L ( x , x ˙ ) d t int L(x,x^(˙))dt\int L(x, \dot{x}) d tL(x,x˙)dt by the Hamilton integral [ p x ˙ H ( x , p ) ] d t [ p x ˙ H ( x , p ) ] d t int[px^(˙)-H(x,p)]dt\int[p \dot{x}-H(x, p)] d t[px˙H(x,p)]dt. Likewise, here one replaces the action integrand of (21.96) by what in flat spacetime would be
(21.99) ( 1 / 4 π ) [ A μ , ν F μ ν + 1 4 F μ ν F μ ν ] (21.99) ( 1 / 4 π ) A μ , ν F μ ν + 1 4 F μ ν F μ ν {:(21.99)(1//4pi)[A_(mu,nu)F^(mu nu)+(1)/(4)F_(mu nu)F^(mu nu)]:}\begin{equation*} (1 / 4 \pi)\left[A_{\mu, \nu} F^{\mu \nu}+\frac{1}{4} F_{\mu \nu} F^{\mu \nu}\right] \tag{21.99} \end{equation*}(21.99)(1/4π)[Aμ,νFμν+14FμνFμν]
In actuality, spacetime is to be regarded as not only curved but also sliced up into spacelike hypersurfaces. This ( 3 + 1 ) ( 3 + 1 ) (3+1)(3+1)(3+1) split of the geometry made it desirable to split the ten geometrodynamic potentials into the six g i j g i j g_(ij)g_{i j}gij and the four lapse and shift functions. Here one similarly splits the four A μ A μ A_(mu)A_{\mu}Aμ into the three components A i A i A_(i)A_{i}Ai of the vector potential and the scalar potential A 0 = ϕ A 0 = ϕ A_(0)=-phiA_{0}=-\phiA0=ϕ (with the sign so chosen that, in flat spacetime in a Minkowski coordinate system, ϕ = A 0 ϕ = A 0 phi=A^(0)\phi=A^{0}ϕ=A0 ). In this notation, the
Lagrange density function, including the standard density factor ( ( 4 ) g ) 1 / 2 ( 4 ) g 1 / 2 (-^((4))g)^(1//2)\left(-{ }^{(4)} g\right)^{1 / 2}((4)g)1/2 but dropping a complete time integral ( / t ) ( A i E i ) ( / t ) A i E i (del//del t)(A_(i)E^(i))(\partial / \partial t)\left(A_{i} \mathcal{E}^{i}\right)(/t)(AiEi) that has no influence on the equations of motion, is given by the formula
(21.100) 4 π E field = E i A i / t + ϕ S i , i 1 2 N g 1 / 2 g i j ( E i S j + B i B j ) + N i [ j k ] E j B k . (21.100) 4 π E field  = E i A i / t + ϕ S i , i 1 2 N g 1 / 2 g i j E i S j + B i B j + N i [ j k ] E j B k . {:[(21.100)4piE_("field ")=-E^(i)delA_(i)//del t+phiS^(i)_(,i)],[-(1)/(2)Ng^(-1//2)g_(ij)(E^(i)S^(j)+B^(i)B^(j))+N^(i)[jk]E^(j)B^(k).]:}\begin{align*} 4 \pi \mathcal{E}_{\text {field }}= & -\mathcal{E}^{i} \partial A_{i} / \partial t+\phi \mathcal{S}^{i}{ }_{, i} \tag{21.100}\\ & -\frac{1}{2} N g^{-1 / 2} g_{i j}\left(\mathcal{E}^{i} \mathcal{S}^{j}+\mathscr{B}^{i} \mathscr{B}^{j}\right)+N^{i}[j k] \mathcal{E}^{j} \mathscr{B}^{k} . \end{align*}(21.100)4πEfield =EiAi/t+ϕSi,i12Ng1/2gij(EiSj+BiBj)+Ni[jk]EjBk.
Here use is made of the alternating symbol [ i j k ] [ i j k ] [ijk][i j k][ijk], defined as changing sign on the interchange of any two labels, and normalized so that [ 123 ] = 1 [ 123 ] = 1 [123]=1[123]=1[123]=1. Note that the 3 -tensor ε i j k ε i j k epsi^(ijk)\varepsilon^{i j k}εijk and the alternating symbol [ i j k ] [ i j k ] [ijk][i j k][ijk] are related much as are the corresponding four-dimensional objects in equation (8.10), so that one can write
(21.101) B i = 1 2 [ i j k ] ( A k , j A j , k ) . (21.101) B i = 1 2 [ i j k ] A k , j A j , k . {:(21.101)B^(i)=(1)/(2)[ijk](A_(k,j)-A_(j,k)).:}\begin{equation*} \mathscr{B}^{i}=\frac{1}{2}[i j k]\left(A_{k, j}-A_{j, k}\right) . \tag{21.101} \end{equation*}(21.101)Bi=12[ijk](Ak,jAj,k).
The quantities G i G i G^(i)\mathscr{G}^{i}Gi are the components of the magnetic field in the spacelike slice. They are not regarded as independently variable. They are treated as fully fixed by the choice of the three potentials A i A i A_(i)A_{i}Ai. The converse is the case for the components S i S i S^(i)\mathcal{S}^{i}Si of the electric field: they are treated like momenta, and as independently variable.
Extremizing the action with respect to the E i E i E^(i)\mathcal{E}^{i}Ei (exercise 21.11) gives the analog of the equation d x / d t = p / m d x / d t = p / m dx//dt=p//md x / d t=p / mdx/dt=p/m in particle mechanics, and the analog of the equation
(21.102) E i = A i / t ϕ / x i (21.102) E i = A i / t ϕ / x i {:(21.102)E_(i)=-delA_(i)//del t-del phi//delx^(i):}\begin{equation*} E_{i}=-\partial A_{i} / \partial t-\partial \phi / \partial x^{i} \tag{21.102} \end{equation*}(21.102)Ei=Ai/tϕ/xi
of flat-spacetime electrodynamics; namely,
(21.103) A i / t ϕ , i N g 1 / 2 g i j E j [ i j k ] N i G k = 0 . (21.103) A i / t ϕ , i N g 1 / 2 g i j E j [ i j k ] N i G k = 0 . {:(21.103)-delA_(i)//del t-phi_(,i)-Ng^(-1//2)g_(ij)E^(j)-[ijk]N^(i)G^(k)=0.:}\begin{equation*} -\partial A_{i} / \partial t-\phi_{, i}-N g^{-1 / 2} g_{i j} \mathcal{E}^{j}-[i j k] N^{i} \mathcal{G}^{k}=0 . \tag{21.103} \end{equation*}(21.103)Ai/tϕ,iNg1/2gijEj[ijk]NiGk=0.
Here the last term containing the shift functions N j N j N^(j)N^{j}Nj, arises from the obliquity of the coordinate system. ADM give the following additional but equivalent ways to state the result (21.103):
S i = 1 2 [ i j k ] F j k (21.104) = 1 2 [ i j k ] { 1 2 [ j k μ ν ] ( ( 4 ) g ) 1 / 2 ( 4 ) g μ ( 4 ) g ν β F α β } . S i = 1 2 [ i j k ] F j k (21.104) = 1 2 [ i j k ] 1 2 [ j k μ ν ] ( 4 ) g 1 / 2 ( 4 ) g μ ( 4 ) g ν β F α β . {:[S^(i)=(1)/(2)[ijk]^(**)F_(jk)],[(21.104)=(1)/(2)[ijk]{(1)/(2)[jk mu nu](-^((4))g)^(1//2(4))g^(mu(4))g^(nu beta)F_(alpha beta)}.]:}\begin{align*} \mathcal{S}^{i} & =\frac{1}{2}[i j k]^{*} F_{j k} \\ & =\frac{1}{2}[i j k]\left\{\frac{1}{2}[j k \mu \nu]\left(-{ }^{(4)} g\right)^{1 / 2(4)} g^{\mu(4)} g^{\nu \beta} F_{\alpha \beta}\right\} . \tag{21.104} \end{align*}Si=12[ijk]Fjk(21.104)=12[ijk]{12[jkμν]((4)g)1/2(4)gμ(4)gνβFαβ}.
They note that E j E j E^(j)\mathcal{E}^{j}Ej and G j G j G^(j)\mathscr{G}^{j}Gj are not directly the contravariant components of the fields in the simultaneity Σ Σ Sigma\SigmaΣ,
(21.105) E = E j e j , B = B j e j , (21.105) E = E j e j , B = B j e j , {:(21.105)E=E^(j)e_(j)","B=B^(j)e_(j)",":}\begin{equation*} \boldsymbol{E}=E^{j} \boldsymbol{e}_{j}, \boldsymbol{B}=B^{j} \boldsymbol{e}_{j}, \tag{21.105} \end{equation*}(21.105)E=Ejej,B=Bjej,
but the contravariant densities,
(21.106) S j = g 1 / 2 E j , B j = g 1 / 2 B j (21.106) S j = g 1 / 2 E j , B j = g 1 / 2 B j {:(21.106)S^(j)=g^(1//2)E^(j)","B^(j)=g^(1//2)B^(j):}\begin{equation*} \mathcal{S}^{j}=g^{1 / 2} E^{j}, \mathscr{B}^{j}=g^{1 / 2} B^{j} \tag{21.106} \end{equation*}(21.106)Sj=g1/2Ej,Bj=g1/2Bj
Extremizing the action with respect to the three A i A i A_(i)A_{i}Ai (exercise 21.12) gives the curved-spacetime analog of the Maxwell equations,
(21.107) E / t = × B . (21.107) E / t = × B . {:(21.107)del E//del t=grad xx B.:}\begin{equation*} \partial \boldsymbol{E} / \partial t=\boldsymbol{\nabla} \times \boldsymbol{B} . \tag{21.107} \end{equation*}(21.107)E/t=×B.
Lagrange density for electromagnetism electromagnetism
Divergence relation by extremization with respect to ϕ ϕ phi\phiϕ
Action principle tells what to fix at limits
At limits, fix not potentials but magnetic field itself
The remaining potential, ϕ ϕ phi\phiϕ, enters the action principle at only one point. Extremizing with respect to it gives immediately the divergence relation of source-free electromagnetism,
(21.108) E , i i = 0 (21.108) E , i i = 0 {:(21.108)E_(,i)^(i)=0:}\begin{equation*} \mathcal{E}_{, i}^{i}=0 \tag{21.108} \end{equation*}(21.108)E,ii=0
If an action principle tells in and by itself what quantities are to be fixed at the limits, what lessons does (21.100) give on this score? One can go back to the example of particle mechanics in Hamiltonian form, as in Box 21.1, and note that there the momentum p p ppp could "flap in the breeze." Only the coordinate x x xxx had to be fixed at the limits. Thus the variation of the action was
(21.109) δ I = δ [ p x ˙ H ( x , p ) ] d t = { [ x ˙ H / p ] δ p + ( d / d t ) ( p δ x ) + [ p ˙ H / x ] δ x } d t (21.109) δ I = δ [ p x ˙ H ( x , p ) ] d t = { [ x ˙ H / p ] δ p + ( d / d t ) ( p δ x ) + [ p ˙ H / x ] δ x } d t {:[(21.109)delta I=delta int[px^(˙)-H(x","p)]dt],[=int{[x^(˙)-del H//del p]delta p+(d//dt)(p delta x)+[-p^(˙)-del H//del x]delta x}dt]:}\begin{align*} \delta I & =\delta \int[p \dot{x}-H(x, p)] d t \tag{21.109}\\ & =\int\{[\dot{x}-\partial H / \partial p] \delta p+(d / d t)(p \delta x)+[-\dot{p}-\partial H / \partial x] \delta x\} d t \end{align*}(21.109)δI=δ[px˙H(x,p)]dt={[x˙H/p]δp+(d/dt)(pδx)+[p˙H/x]δx}dt
To arrive at a well-defined extremum of the action integral I I III, it was not enough to annul the coefficients, in square brackets, of δ p δ p delta p\delta pδp and δ x δ x delta x\delta xδx; that is, to impose Hamilton's equations of motion. It was necessary in addition to annul the quantities at limits, p δ x p δ x p delta xp \delta xpδx; that is, to specify x x xxx at the start and at the end of the motion. Similarly here. The quantities ϕ ϕ phi\phiϕ and E i E i E^(i)\mathscr{E}^{i}Ei flap in the breeze, but the magnetic field has to be specified on the two faces of the sandwich to allow one to speak of a well-defined extremum of the action principle. Why the magnetic field, or the three quantities
(21.110) A j / x i A i / x j (21.110) A j / x i A i / x j {:(21.110)delA_(j)//delx^(i)-delA_(i)//delx^(j):}\begin{equation*} \partial A_{j} / \partial x^{i}-\partial A_{i} / \partial x^{j} \tag{21.110} \end{equation*}(21.110)Aj/xiAi/xj
why not the three A i A i A_(i)A_{i}Ai themselves? When one varies (21.100) with respect to the A i A i A_(i)A_{i}Ai, and integrates the variation of the first term by parts, as one must to arrive at the dynamic equations, one obtains a term at limits
(21.111) Σ intual E i δ A i d 3 x Σ final E i δ A i d 3 x (21.111) Σ intual  E i δ A i d 3 x Σ final  E i δ A i d 3 x {:(21.111)int_(Sigma_("intual "))E^(i)deltaA_(i)d^(3)x-int_(Sigma_("final "))E^(i)deltaA_(i)d^(3)x:}\begin{equation*} \int_{\Sigma_{\text {intual }}} \mathcal{E}^{i} \delta A_{i} d^{3} x-\int_{\Sigma_{\text {final }}} \mathcal{E}^{i} \delta A_{i} d^{3} x \tag{21.111} \end{equation*}(21.111)Σintual EiδAid3xΣfinal EiδAid3x
One demands that both these terms at limits must vanish in order to have a welldefined variational problem. Go from the given vector potential to another vector potential, A i new A i new  A_(i_("new "))A_{i_{\text {new }}}Ainew , by the gauge transformation
(21.112) A i new = A i + δ A i = A i + λ / x i (21.112) A i new  = A i + δ A i = A i + λ / x i {:(21.112)A_(i_("new "))=A_(i)+deltaA_(i)=A_(i)+del lambda//delx^(i):}\begin{equation*} A_{i_{\text {new }}}=A_{i}+\delta A_{i}=A_{i}+\partial \lambda / \partial x^{i} \tag{21.112} \end{equation*}(21.112)Ainew =Ai+δAi=Ai+λ/xi
The magnetic-field components given by the three A i new A i new  A_(i_("new "))A_{i_{\text {new }}}Ainew  differ in no way from those listed in (21.110). Moreover the "variation at limits,"
(21.113) E i δ A i d 3 x = E i λ / x i d 3 x = λ E , i i d 3 x (21.113) E i δ A i d 3 x = E i λ / x i d 3 x = λ E , i i d 3 x {:(21.113)intE^(i)deltaA_(i)d^(3)x=intE^(i)del lambda//delx^(i)d^(3)x=-int lambdaE_(,i)^(i)d^(3)x:}\begin{equation*} \int \mathcal{E}^{i} \delta A_{i} d^{3} x=\int \mathcal{E}^{i} \partial \lambda / \partial x^{i} d^{3} x=-\int \lambda \mathcal{E}_{, i}^{i} d^{3} x \tag{21.113} \end{equation*}(21.113)EiδAid3x=Eiλ/xid3x=λE,iid3x
is automatically zero by virtue of the divergence condition (21.108), for any arbitrary choice of λ λ lambda\lambdaλ. Therefore the quantities fixed at limits are not the three A i A i A_(i)A_{i}Ai themselves (mere potentials) but the physically significant quantities (21.110), the components of the magnetic field. Moreover, the divergence condition E i , i = 0 E i , i = 0 E^(i)_(,i)=0\mathcal{E}^{i}{ }_{, i}=0Ei,i=0 now becomes the initial-value equation for the determination of the potential ϕ ϕ phi\phiϕ.
In the preceding paragraph one need only replace "the three A i A i A_(i)A_{i}Ai " by "the six g i j g i j g_(ij)g_{i j}gij " and "the components of the magnetic field" by "the 3-geometry ( 3 ) g ( 3 ) g ^((3))g{ }^{(3)} g(3)g " and "the potential ϕ ϕ phi\phiϕ " by "the lapse and shift functions N N NNN and N i " N i " N^(i")N^{i "}Ni" to pass from electrodynamics to geometrodynamics.
With this parallelism in view, turn back to the variational principle (21.95) of general relativity in the ADM formulation. With the 3-geometry fixed on the two faces of the sandwich, vary conditions in between to extremize the action, varying in turn the π i j π i j pi^(ij)\pi^{i j}πij, the g i j g i j g_(ij)g_{i j}gij, and the lapse and shift functions. The geometrodynamic momenta appear everywhere only algebraically in the action principle, except in the term 2 N i π i j j 2 N i π i j j -2N_(i)pi^(ij)_(∣j)-2 N_{i} \pi^{i j}{ }_{\mid j}2Niπijj. Variation and integration by parts gives 2 N i j δ π i j 2 N i j δ π i j 2N_(i∣j)deltapi^(ij)2 N_{i \mid j} \delta \pi^{i j}2Nijδπij. Collecting coefficients of δ π i j δ π i j deltapi^(ij)\delta \pi^{i j}δπij and annuling the sum of these coefficients, one arrives at one of the several conditions required for an extremum,
(21.114) g i j / t = 2 N g 1 / 2 ( π i j 1 2 g i j Tr π ) + N i j + N j i (21.114) g i j / t = 2 N g 1 / 2 π i j 1 2 g i j Tr π + N i j + N j i {:(21.114)delg_(ij)//del t=2Ng^(-1//2)(pi_(ij)-(1)/(2)g_(ij)Tr pi)+N_(i∣j)+N_(j∣i):}\begin{equation*} \partial g_{i j} / \partial t=2 N g^{-1 / 2}\left(\pi_{i j}-\frac{1}{2} g_{i j} \operatorname{Tr} \pi\right)+N_{i \mid j}+N_{j \mid i} \tag{21.114} \end{equation*}(21.114)gij/t=2Ng1/2(πij12gijTrπ)+Nij+Nji
This result agrees with what one gets from equations (21.91) defining geometrodynamic momentum in terms of extrinsic curvature, together with expression (21.67) for extrinsic curvature in terms of lapse and shift. The result (21.114) here is no less useful than the result
d x / d t = H ( x , p ) / p = p / m d x / d t = H ( x , p ) / p = p / m dx//dt=del H(x,p)//del p=p//md x / d t=\partial H(x, p) / \partial p=p / mdx/dt=H(x,p)/p=p/m
in the most elementary problem in mechanics: it marks the first step in splitting a second-order equation or equations into twice as many first-order equations.
Now vary the action with respect to the g i j g i j g_(ij)g_{i j}gij and again, after appropriate integration by parts and rearrangement, find the remaining first-order dynamic equations of general relativity [simplified by use of equations (21.116) and (21.117)],
π i j / t = N g 1 / 2 ( R i j 1 2 g i j R ) + 1 2 N g 1 / 2 g i j ( Tr n 2 1 2 ( Tr π ) 2 ) 2 N g 1 / 2 ( π i m π m j 1 2 π i j Tr π ) (21.115) + g 1 / 2 ( N i j g i j N m m ) + ( π i j N m ) m N i m π m j N j m π m i + [ source terms arising from fields other than geometry, omitted here for simplicity, but discussed by ADM (1962) ] i j π i j / t = N g 1 / 2 R i j 1 2 g i j R + 1 2 N g 1 / 2 g i j Tr n 2 1 2 ( Tr π ) 2 2 N g 1 / 2 π i m π m j 1 2 π i j Tr π (21.115) + g 1 / 2 N i j g i j N m m + π i j N m m N i m π m j N j m π m i +  source terms arising from fields   other than geometry, omitted here for   simplicity, but discussed by ADM (1962)  i j {:[delpi^(ij)//del t=-Ng^(1//2)(R^(ij)-(1)/(2)g^(ij)R)+(1)/(2)Ng^(-1//2)g^(ij)(Trn^(2)-(1)/(2)(Tr pi)^(2))],[-2Ng^(-1//2)(pi^(im)pi_(m)^(j)-(1)/(2)pi^(ij)Tr pi)],[(21.115)+g^(1//2)(N^(∣ij)-g^(ij)N^(∣m)_(∣m))+(pi^(ij)N^(m))_(∣m)],[-N^(i)_(∣m)pi^(mj)-N^(j)_(∣m)pi^(mi)+[[" source terms arising from fields "],[" other than geometry, omitted here for "],[" simplicity, but discussed by ADM (1962) "]]^(ij)]:}\begin{align*} \partial \pi^{i j} / \partial t= & -N g^{1 / 2}\left(R^{i j}-\frac{1}{2} g^{i j} R\right)+\frac{1}{2} N g^{-1 / 2} g^{i j}\left(\operatorname{Tr} n^{2}-\frac{1}{2}(\operatorname{Tr} \pi)^{2}\right) \\ & -2 N g^{-1 / 2}\left(\pi^{i m} \pi_{m}{ }^{j}-\frac{1}{2} \pi^{i j} \operatorname{Tr} \pi\right) \\ & +g^{1 / 2}\left(N^{\mid i j}-g^{i j} N^{\mid m}{ }_{\mid m}\right)+\left(\pi^{i j} N^{m}\right)_{\mid m} \tag{21.115}\\ & -N^{i}{ }_{\mid m} \pi^{m j}-N^{j}{ }_{\mid m} \pi^{m i}+\left[\begin{array}{l} \text { source terms arising from fields } \\ \text { other than geometry, omitted here for } \\ \text { simplicity, but discussed by ADM (1962) } \end{array}\right]^{i j} \end{align*}πij/t=Ng1/2(Rij12gijR)+12Ng1/2gij(Trn212(Trπ)2)2Ng1/2(πimπmj12πijTrπ)(21.115)+g1/2(NijgijNmm)+(πijNm)mNimπmjNjmπmi+[ source terms arising from fields  other than geometry, omitted here for  simplicity, but discussed by ADM (1962) ]ij
Finally extremize the action (21.95) with respect to the lapse function N N NNN and the shift functions N i N i N_(i)N_{i}Ni, and find the four so-called initial-value equations of general relativity, equivalent to (21.77) and (21.81) or to G n α = 8 π T n α G n α = 8 π T n α G_(n)^(alpha)=8piT_(n)^(alpha)G_{n}^{\alpha}=8 \pi T_{n}^{\alpha}Gnα=8πTnα; thus,
(21.116) ( 1 / 16 π ) H ( π i j , g i j ) = ( 1 / 8 π ) N g 1 / 2 g i j ( E i E j + B i B j ) , (21.117) ( 1 / 16 π ) H i ( π i j , g i j ) = ( 1 / 4 π ) [ i j k ] E j B k . (21.116) ( 1 / 16 π ) H π i j , g i j = ( 1 / 8 π ) N g 1 / 2 g i j E i E j + B i B j , (21.117) ( 1 / 16 π ) H i π i j , g i j = ( 1 / 4 π ) [ i j k ] E j B k . {:[(21.116)-(1//16 pi)H(pi^(ij),g_(ij))=(1//8pi)Ng^(-1//2)g_(ij)(E^(i)E^(j)+B^(i)B^(j))","],[(21.117)-(1//16 pi)H^(i)(pi^(ij),g_(ij))=-(1//4pi)[ijk]E^(j)B^(k).]:}\begin{gather*} -(1 / 16 \pi) \mathcal{H}\left(\pi^{i j}, g_{i j}\right)=(1 / 8 \pi) N g^{-1 / 2} g_{i j}\left(\mathcal{E}^{i} \mathscr{E}^{j}+\mathscr{B}^{i} \mathscr{B}^{j}\right), \tag{21.116}\\ -(1 / 16 \pi) \mathscr{H}^{i}\left(\pi^{i j}, g_{i j}\right)=-(1 / 4 \pi)[i j k] \mathcal{E}^{j} \mathscr{B}^{k} . \tag{21.117} \end{gather*}(21.116)(1/16π)H(πij,gij)=(1/8π)Ng1/2gij(EiEj+BiBj),(21.117)(1/16π)Hi(πij,gij)=(1/4π)[ijk]EjBk.
Dynamic and initial-value equations out of ADM formalism

EXERCISES

Exercise 21.11. FIRST EXPLOITATION OF THE ADM VARIATIONAL PRINCIPLE FOR THE ELECTROMAGNETIC FIELD

Extremize the action principle (21.100) with respect to the E i E i E^(i)\mathcal{E}^{i}Ei and derive the result (21.103).

Exercise 21.12. SECOND EXPLOITATION OF THE ADM VARIATIONAL PRINCIPLE FOR THE ELECTROMAGNETIC FIELD

Extremize (21.100) with respect to the A i A i A_(i)A_{i}Ai, and verify that the resulting equations in any Minkowski-flat region are equivalent to (21.107).
Exercise 21.13. FARADAY-MAXWELL SOURCE TERM IN THE DYNAMIC EQUATIONS OF GENERAL RELATIVITY
Evaluate the final indicated source terms in (21.115) from the Lagrangian (21.100) of Maxwell electrodynamics, regarded as a function of the A i A i A_(i)A_{i}Ai and the g i j g i j g_(ij)g_{i j}gij.

Exercise 21.14. THE CHOICE OF ϕ ϕ phi\phiϕ DOESN'T MATTER

Prove the statement in the text that the dynamic development of the electric and magnetic fields themselves is independent of the choice made for the scalar potential ϕ ( t , x , y , z ) ϕ ( t , x , y , z ) phi(t,x,y,z)\phi(t, x, y, z)ϕ(t,x,y,z) in the analysis (a) in flat spacetime in Minkowski coordinates and (b) in general relativity, according to equations (21.103), and (21.107) as generalized in exercise 21.12.

Exercise 21.15. THE CHOICE OF SLICING OF SPACETIME DOESN'T MATTER

Given a metric ( 3 ) g i j ( x , y , z ) ( 3 ) g i j ( x , y , z ) ^((3))g_(ij)(x,y,z){ }^{(3)} g_{i j}(x, y, z)(3)gij(x,y,z) and an extrinsic curvature K i j ( x , y , z ) K i j ( x , y , z ) K^(ij)(x,y,z)K^{i j}(x, y, z)Kij(x,y,z) on a spacelike hypersurface Σ Σ Sigma\boldsymbol{\Sigma}Σ, and given that these quantities satisfy the initial-value equations (21.116) and (21.117), and given two alternative choices for the lapse and shift functions ( N , N i ) N , N i (N,N_(i))\left(N, N_{i}\right)(N,Ni) and ( N + δ N ( N + δ N (N+delta N(N+\delta N(N+δN, N i + δ N i N i + δ N i N_(i)+deltaN_(i)N_{i}+\delta N_{i}Ni+δNi ), show that the curvature itself (as distinguished from its components in these two distinct coordinate systems), as calculated at a point P P P\mathscr{P}P a "little way" (first order of small quantities) off the hypersurface, by way of the dynamic equations (21.114) and (21.115), is independent of this choice of lapse and shift.

§21.8. INTEGRATING FORWARD IN TIME

In the Hamiltonian formalism of Arnowitt, Deser, and Misner [see also the many papers by many workers on the quantization of general relativity-primarily putting Einstein's theory into Hamiltonian form-cited, for example, in references 1 and 2 of Wheeler (1968)], the dynamics of geometry takes a form quite similar to the Hamiltonian dynamics of geometry. There one gives x x xxx and p p ppp at a starting time and integrates two first-order equations for d x / d t d x / d t dx//dtd x / d tdx/dt and d p / d t d p / d t dp//dtd p / d tdp/dt ahead in time to find these dynamically conjugate variables at all future times. Here one gives appropriate values of g i j g i j g_(ij)g_{i j}gij and π i j π i j pi^(ij)\pi^{i j}πij over an initial spacelike hypersurface and integrates the two first-order equations (21.114) and (21.115) ahead in time to find the geometry at future times. For example, one can rewrite the differential equations as difference equations according to the practice by now familiar in modern hydrodynamics, and then carry out the integration on an electronic digital computer of substantial memory capacity.
Time in general relativity has a many-fingered quality very different from the one-parameter nature of time in nonrelativistic particle mechanics [see, however, Dirac, Fock, and Podolsky (1932) for a many-time formalism for treating the relativistic dynamics of a system of many interacting particles]. He who is studying the geometry is free to push ahead the spacelike hypersurface faster at one place than another, so long as he keeps it spacelike. This freedom expresses itself in the lapse function N ( t , x , y , z ) N ( t , x , y , z ) N(t,x,y,z)N(t, x, y, z)N(t,x,y,z) at each stage, t t ttt, of the integration. Equations (21.114) and (21.115) are not a conduit to feed out information on N N NNN to the analyst. They are a conduit for the analyst to feed in information on N N NNN. The choice of N N NNN is to be made, not by nature, but by man. The dynamic equations cannot begin to fulfill their purpose until this choice is made. The "time parameter" t t ttt is only a label to distinguish one spacelike hypersurface from another in a one-parameter family of hypersurface; but N N NNN thus tells the spacing in proper time, as it varies from place to place, between the successive slices on which one chooses to record the time-evolution of the geometry. A cinema camera can record what happens only one frame at a time, but the operator can make a great difference in what that camera sees by his choice of angle for the filming of the scene. So here, with the choice of slicing.
Another choice is of concern to the analyst, especially one doing his analysis on a digital computer. He is in the course of determining, via (21.114-21.115) written as difference equations, what happens on a lattice work of points, typified by x = , 73 , 74 , 75 , 76 , 77 , x = , 73 , 74 , 75 , 76 , 77 , x=dots,73,74,75,76,77,dotsx=\ldots, 73,74,75,76,77, \ldotsx=,73,74,75,76,77,, etc. He finds that the curvatures are developing most strongly in a localized region in the range around x = 83 x = 83 x=83x=83x=83 to x = 89 x = 89 x=89x=89x=89. He wants to increase the density of coverage of his tracer points in this region. He does so by causing points at lesser and greater x x xxx values to drift into this region moment by moment as t t ttt increases: t = , 122 , 123 , 124 , t = , 122 , 123 , 124 , t=dots,122,123,124,dotst=\ldots, 122,123,124, \ldotst=,122,123,124,. He makes the tracer points at lesser x x xxx-values start to move to the right ( N 1 N 1 N_(1)N_{1}N1 positive) and points at greater x x xxx-values move to the left ( N 1 N 1 N_(1)N_{1}N1 negative). In other words, the choice of the three shift functions N i ( t , x , y , z ) N i ( t , x , y , z ) N_(i)(t,x,y,z)N_{i}(t, x, y, z)Ni(t,x,y,z) is just as much the responsibility of the analyst as is the choice of the lapse function N N NNN. The equations will never tell him what to pick. He has to tell the equations.
These options, far from complicating dynamic equations (21.114-21.115), make them flexible and responsive to the wishes of the analyst in following the course of whatever geometrodynamic process is in his hands for study.
The freedom that exists in general relativity in the choice of the four functions N , N i N , N i N,N_(i)N, N_{i}N,Ni, is illuminated from another side by comparing it with the freedom one has in electrodynamics to pick the one function ϕ ( t , x , y , z ) ϕ ( t , x , y , z ) phi(t,x,y,z)\phi(t, x, y, z)ϕ(t,x,y,z), the scalar potential. In no way do the dynamic Maxwell equations (21.103) and (21.107), as generalized in exercise 21.12 determine ϕ ϕ phi\phiϕ. Instead they demand that it be determined (by the analyst) as the price for predicting the time-development of the vector potential A i A i A_(i)A_{i}Ai. An altered choice of ϕ ( t , x , y , z ) ϕ ( t , x , y , z ) phi(t,x,y,z)\phi(t, x, y, z)ϕ(t,x,y,z) in its dependence on position and time means altered results from the dynamic equations for the development of the three A i A i A_(i)A_{i}Ai in time and space. However, the physically significant quantities, the electric and magnetic fields themselves on successive hypersurfaces, come out the same (exercise 21.14) regardless of this choice of ϕ ϕ phi\phiϕ. Similarly in geometrodynamics: an altered choice for the four
Lapse and shift chosen to push forward the integration in time as one finds most convenient
Same 4-geometry regardless of lapse and shift options
Figure 21.4.
Some of the many ways to make distinct spacelike slices through one and the same ( 4 ) ( 4 ) ^((4)){ }^{(4)}(4) ?, the complete Schwarzschild 4-geometry.
functions N , N i N , N i N,N_(i)N, N_{i}N,Ni, means (a) an altered laying down of coordinates in spacetime, and therefore (b) altered results for the intrinsic metric ( 3 ) g i j ( 3 ) g i j ^((3))g_(ij){ }^{(3)} g_{i j}(3)gij and extrinsic curvature K i j K i j K^(ij)K^{i j}Kij of successive spacelike hypersurfaces, but yields the same 4 -geometry ( 4 ) y ( 4 ) y ^((4))y{ }^{(4)} \mathscr{y}(4)y (Figure 21.4) regardless of this choice of coordinatization (exercise 21.15).

§21.9. THE INITIAL-VALUE PROBLEM IN THE THIN-SANDWICH FORMULATION

Given appropriate initial-value data, one can integrate the dynamic equations ahead in time and determine the evolution of the geometry; but what are "appropriate initial-value data"? They are six functions ( 3 ) g i j ( x , y , z ) ( 3 ) g i j ( x , y , z ) ^((3))g_(ij)(x,y,z){ }^{(3)} g_{i j}(x, y, z)(3)gij(x,y,z) plus six more functions π i j ( x , y , z ) π i j ( x , y , z ) pi^(ij)(x,y,z)\pi^{i j}(x, y, z)πij(x,y,z) or K i j ( x , y , z ) K i j ( x , y , z ) K^(ij)(x,y,z)K^{i j}(x, y, z)Kij(x,y,z) that together satisfy the four initial-value equations (21.116) and (21.117). To be required to give coordinates and momenta accords with the familiar plan of Hamiltonian mechanics; but to have consistency conditions or "constraints" imposed on such data is less familiar. A particle moving in two-dimensional space is catalogued by coordinates x , y x , y x,yx, yx,y, and coordinates p x , p y p x , p y p_(x),p_(y)p_{x}, p_{y}px,py; but a particle forced to remain on the circle x 2 + y 2 = a 2 x 2 + y 2 = a 2 x^(2)+y^(2)=a^(2)x^{2}+y^{2}=a^{2}x2+y2=a2 satisfies the constraint x p x + y p y = 0 x p x + y p y = 0 xp_(x)+yp_(y)=0x p_{x}+y p_{y}=0xpx+ypy=0. Thus the existence of a "constraint" is a signal that the system possesses fewer degrees
of freedom than one would otherwise suppose. Fully to analyze the four "initialvalue" or "constraint" conditions (21.116) and (21.117) is thus to determine (1) how many dynamic degrees of freedom the geometry possesses and (2) what these degrees of freedom are; that is to say, precisely what "handles" one can freely adjust to govern completely the geometry and its evolution with time. The counting one can do today, with the conclusion that the geometry possesses the same count of true degrees of freedom as the electromagnetic field. The identification of the "handles," or freely adjustable features of the dynamics, is less advanced for geometry than it is for electromagnetism (Box 21.2), but most instructive so far as it goes.
By rights the identification of the degrees of freedom of the field, whether that of Einstein or that of Faraday and Maxwell, requires nothing more than knowing what must be fixed on initial and final spacelike hypersurfaces to make the appropriate variation principle well-defined. One then has the option whether (1) to give that quantity on both hypersurfaces or (2) to give that quantity and its dynamic conjugate on one hypersurface or (3) to give the quantity on both hypersurfaces, as in (1), but go to the limit of an infinitely thin sandwich, so that one ends up specifying the quantity and its time rate of change on one hypersurface. This third "thin sandwich" procedure is simplest for a quick analysis of the initial-value problem in both electrodynamics and geometrodynamics. Take electrodynamics first, as an illustration.
Give the divergence-free magnetic field and its time-rate of change: on an arbitrary smooth spacelike hypersurface in curved spacetime in the general case; on the hypersurface t = 0 t = 0 t=0t=0t=0 in Minkowski spacetime in the present illustrative treatment,
(21.118) B i ( 0 , x , y , z ) given, (21.119) B ˙ i ( 0 , x , y , z ) = ( B i t ) also given. (21.118) B i ( 0 , x , y , z )  given,  (21.119) B ˙ i ( 0 , x , y , z ) = B i t  also given.  {:[(21.118)B^(i)(0","x","y","z)" given, "],[(21.119)B^(˙)^(i)(0","x","y","z)=((delB^(i))/(del t))" also given. "]:}\begin{gather*} \mathscr{B}^{i}(0, x, y, z) \text { given, } \tag{21.118}\\ \dot{\mathscr{B}}^{i}(0, x, y, z)=\left(\frac{\partial \mathscr{B}^{i}}{\partial t}\right) \text { also given. } \tag{21.119} \end{gather*}(21.118)Bi(0,x,y,z) given, (21.119)B˙i(0,x,y,z)=(Bit) also given. 
These quantities together contain four and only four independent data per space point. How is one now to obtain the momenta π i E i π i E i pi^(i)∼-E^(i)\pi^{i} \sim-\mathcal{E}^{i}πiEi so that one can start integrating the dynamic equations (21.103) and (21.107) forward in time? (1) Find a set of three functions A i ( 0 , x , y , z ) A i ( 0 , x , y , z ) A_(i)(0,x,y,z)A_{i}(0, x, y, z)Ai(0,x,y,z) such that their curl gives the three specified B i B i B^(i)\mathscr{B}^{i}Bi. That this can be done at all is guaranteed by the vanishing of the divergence B i , i B i , i B^(i)_(,i)\mathscr{B}^{i}{ }_{, i}Bi,i. However, the choice of the A i A i A_(i)A_{i}Ai is not unique. The new set of potentials A inew = A inew  = A_("inew ")=A_{\text {inew }}=Ainew = A i + λ / x i A i + λ / x i A_(i)+del lambda//delx^(i)A_{i}+\partial \lambda / \partial x^{i}Ai+λ/xi with arbitrary smooth λ λ lambda\lambdaλ, provide just as good a solution as the original A i A i A_(i)A_{i}Ai. No matter. Pick one solution and stick to it. (2) Similarly, find a set of three A ˙ i ( 0 , x , y , z ) A ˙ i ( 0 , x , y , z ) A^(˙)_(i)(0,x,y,z)\dot{A}_{i}(0, x, y, z)A˙i(0,x,y,z) such that their curl gives the specified G ˙ i ( 0 , x , y , z ) G ˙ i ( 0 , x , y , z ) G^(˙)^(i)(0,x,y,z)\dot{\mathscr{G}}^{i}(0, x, y, z)G˙i(0,x,y,z), and resolve all arbitrariness of choice by fiat. (3) Recall that the electric field (negative of the field momentum) is given by
(21.120) E i = A ˙ i ϕ / x i (21.120) E i = A ˙ i ϕ / x i {:(21.120)E_(i)=-A^(˙)_(i)-del phi//delx^(i):}\begin{equation*} \mathcal{E}_{i}=-\dot{A}_{i}-\partial \phi / \partial x^{i} \tag{21.120} \end{equation*}(21.120)Ei=A˙iϕ/xi
(formula valid without amendment only in flat space). The initial-value or constraint equation E i , i = 0 E i , i = 0 E^(i)_(,i)=0\mathcal{E}^{i}{ }_{, i}=0Ei,i=0 translates to the form
(21.121) 2 ϕ = η i j A ˙ i , j (21.121) 2 ϕ = η i j A ˙ i , j {:(21.121)grad^(2)phi=-eta^(ij)A^(˙)_(i,j):}\begin{equation*} \nabla^{2} \phi=-\eta^{i j} \dot{A}_{i, j} \tag{21.121} \end{equation*}(21.121)2ϕ=ηijA˙i,j
In electromagnetism, give magnetic field and its rate of change as initial data

Box 21.2 COUNTING THE DEGREES OF FREEDOM OF THE ELECTROMAGNETIC FIELD

A. First Approach: Number of "Field Coordinates" per Spacepoint

Superficial tally of the degrees of freedom of the source-free electromagnetic field gives three field coordinates A i ( x , y , z ) A i ( x , y , z ) A_(i)(x,y,z)A_{i}(x, y, z)Ai(x,y,z) per spacepoint on the initial simultaneity Σ Σ Sigma\SigmaΣ, plus three field momenta π true i = π true  i = pi_("true ")^(i)=\pi_{\text {true }}^{i}=πtrue i= π i / 4 π [ π i / 4 π pi^(i)//4pi[:}\pi^{i} / 4 \pi\left[\right.πi/4π[ with π i = E i ( x , y , z ) ] π i = E i ( x , y , z ) {:pi^(i)=-E^(i)(x,y,z)]\left.\pi^{i}=-\mathcal{E}^{i}(x, y, z)\right]πi=Ei(x,y,z)] per spacepoint.
Closer inspection reveals that the number of coordinate degrees of freedom per spacepoint is not three but two. Thus the change in vector potential A i A i + λ / x i A i A i + λ / x i A_(i)longrightarrowA_(i)+del lambda//delx^(i)A_{i} \longrightarrow A_{i}+\partial \lambda / \partial x^{i}AiAi+λ/xi makes no change in the actual physics, the magnetic field components,
B i = 1 2 [ i j k ] ( A k / x j A j / x k ) . B i = 1 2 [ i j k ] A k / x j A j / x k . B^(i)=(1)/(2)[ijk](delA_(k)//delx^(j)-delA_(j)//delx^(k)).B^{i}=\frac{1}{2}[i j k]\left(\partial A_{k} / \partial x^{j}-\partial A_{j} / \partial x^{k}\right) .Bi=12[ijk](Ak/xjAj/xk).
Moreover, though those components are three in number, they satisfy one condition per spacepoint, B i , i = 0 B i , i = 0 B^(i)_(,i)=0\mathscr{B}^{i}{ }_{, i}=0Bi,i=0, thus reducing the effective net number of coordinate degrees of freedom per spacepoint to two.
The momentum degrees of freedom per spacepoint are likewise reduced from three to two by the one condition per spacepoint E i , i = 0 E i , i = 0 E^(i)_(,i)=0\mathcal{E}^{i}{ }_{, i}=0Ei,i=0.

B. Alternative Approach: Count Fourier Coefficients

In textbooks on field theory [see, for example, Wentzel (1949)], attention focuses on flat spacetime. The electromagnetic field is decomposed by Fourier analysis into individual running waves. Instead of counting degrees of freedom per point in coordinate space, one does the equivalent: counts up degrees of freedom per point in wavenumber space. Thus for each ( k x , k y , k z ) k x , k y , k z (k_(x),k_(y),k_(z))\left(k_{x}, k_{y}, k_{z}\right)(kx,ky,kz), there are two independent states of polarization. Each state of polarization requires for its description an amplitude ("coordinate") and time-rate of change of amplitude ("momentum") at the initial time, t 0 t 0 t_(0)^(')t_{0}^{\prime}t0. Thus the number of degrees of freedom per point in wave-number space is two for coordinates and two for momenta, in accord with what one gets by carrying out the count in coordinate space.
In curved spacetime, Fourier analysis is a less convenient way of identifying the degrees of freedom of the electromagnetic field [for such a Fourier analysis, see Misner and Wheeler (1957), especially their Table X and following text] than direct analysis in space, as above.

C. Another Alternative: Analyze "Deformation of Structure"

Still a third way to get a handle on the degrees of freedom of a divergence-free field, whether E E E\mathcal{E}E or B B B\mathscr{B}B, rests on the idea of deformation of structure [diagram from Wheeler (1964)]. Represent the

magnetic field by Faraday's picture of lines of force (a) continuing through space without ever ending, automatic guarantee that B i , i B i , i B^(i)_(,i)\mathscr{B}^{i}{ }_{, i}Bi,i is everywhere zero. Insert "knitting needles" (b) into the spaghetti-like structure of the lines of force and move these needles as one will. Sliding the "knitting needles" along a line of force causes no movement of the line of force. (c) With the help of two knitting needles perpendicular to each other and to the line of force, one can give any given line of force any small displacement one pleases perpendicular to its length: again two degrees of freedom per spacepoint. Granted any non-zero field to begin with, no matter how small, one can build it up by a sequence of such small deformations to agree with any arbitrary field pattern of zero divergence, no matter what its complexity and strength may be.
Solve for ϕ ϕ phi\phiϕ. Then (4) equation (21.120) gives the initial-value electric field, or electrodynamic field momentum π i E i π i E i pi^(i)∼-E^(i)\pi^{i} \sim-\mathcal{E}^{i}πiEi, required (along with the field coordinate A i A i A_(i)A_{i}Ai ) for starting the integration of the dynamic equations (21.103) and (21.107). [Misner and Wheeler (1957) deal with the additional features that come in when the space is multiply connected. Each wormhole or handle of the geometry is able to trap electric lines of force. The flux trapped in any one wormhole defines the classical electric charge q w q w q_(w)q_{w}qw associated with that wormhole. One has to specify all these charges once and for all in addition to the data (21.118) and (21.119) in order to determine fully the dynamic evolution of the electromagnetic field. There is no geometrodynamic analog to electric charge, according to Unruh (1971).] (5) In this integration, the scalar potential ϕ ϕ phi\phiϕ at each subsequent time step is not to be calculated; it is to be chosen. Only when one has made this free choice definite do the dynamic equations come out with definite results for the A i A i A_(i)A_{i}Ai and the π i π i pi^(i)\pi^{i}πi or E i E i E^(i)\mathscr{E}^{i}Ei at these successive steps.
In the thin-sandwich formulation of the initial-value problem of electrodynamics, to summarize, one gives B i B i B^(i)\mathscr{B}^{i}Bi and B ˙ i B ˙ i B^(˙)^(i)\dot{\mathscr{B}}^{i}B˙i (equivalent to B B B\mathscr{B}B on two nearby hypersurfaces). One chooses the A i A i A_(i)A_{i}Ai and A ˙ i A ˙ i A^(˙)_(i)\dot{A}_{i}A˙i with much arbitrariness to represent these initial-value data. The arbitrariness having been seized on to give the initial A i A i A_(i)A_{i}Ai and A i A i A_(i)A_{i}Ai, there is no arbitrariness left in the initial ϕ ϕ phi\phiϕ. However, at all subsequent times the situation is just the other way around. All the arbitrariness is sopped up in the choice of the ϕ ϕ phi\phiϕ, leaving no arbitrariness whatever in the three A i A i A_(i)A_{i}Ai (as given by the integration of the dynamic equation).
The situation is quite similar in geometrodynamics. One gives the beginnings of a 1-parameter family of spacelike hypersurfaces; namely,
(21.122) (3) g ( 0 ) given, (21.123) (3) y ˙ ( 0 ) = ( 3 ) 0 t given, (21.122)  (3)  g ( 0 )  given,  (21.123)  (3)  y ˙ ( 0 ) = ( 3 ) 0 t  given,  {:[(21.122)" (3) "g(0)" given, "],[(21.123)" (3) "y^(˙)(0)=(del^((3))ℓ_(0))/(del t)" given, "]:}\begin{gather*} \text { (3) } \mathscr{g}(0) \text { given, } \tag{21.122}\\ \text { (3) } \dot{y}(0)=\frac{\partial^{(3)} \ell_{0}}{\partial t} \text { given, } \tag{21.123} \end{gather*}(21.122) (3) g(0) given, (21.123) (3) y˙(0)=(3)0t given, 
Then (1) one picks a definite set of coordinates x i = ( x , y , z ) x i = ( x , y , z ) x^(i)=(x,y,z)x^{i}=(x, y, z)xi=(x,y,z) and in terms of those coordinates finds the unique metric coefficients g i j ( x , y , z ) g i j ( x , y , z ) g_(ij)(x,y,z)g_{i j}(x, y, z)gij(x,y,z) that describe that 3-geometry. The existence of a solution is guaranteed by the circumstance that ( 3 ) y ( 3 ) y ^((3))y{ }^{(3)} \mathscr{y}(3)y is a Riemannian geometry. However, one could have started with different coordinates and ended up with different metric coefficients for the description of the same 3 -geometry. No matter. Pick one set of coordinates, take the resulting metric coefficients, and stick to them as giving half the required initial-value data. (2) Similarly, to describe the 3 -geometry ( 3 ) y + ( 3 ) y ˙ d t ( 3 ) y + ( 3 ) y ˙ d t ^((3))y+^((3))y^(˙)dt{ }^{(3)} y+{ }^{(3)} \dot{y} d t(3)y+(3)y˙dt at the value of the parameter t + d t t + d t t+dtt+d tt+dt, make use of coordinates x i + x ˙ i d t x i + x ˙ i d t x^(i)+x^(˙)^(i)dtx^{i}+\dot{x}^{i} d txi+x˙idt and arrive at the metric coefficients g i j + g ˙ i j d t g i j + g ˙ i j d t g_(ij)+g^(˙)_(ij)dtg_{i j}+\dot{g}_{i j} d tgij+g˙ijdt. The arbitrariness in the x i x i x^(i)x^{i}xi having thus been resolved by fiat, and the ( 3 ) y ( 3 ) y ^((3))y{ }^{(3)} y(3)y being given as definite initial physical data, the g i j g i j g_(ij)g_{i j}gij are thereby completely fixed. (3) Recall that the components of the extrinsic curvature K i j K i j K_(ij)K_{i j}Kij or the momenta π i j π i j pi^(ij)\pi^{i j}πij are given in terms of the g i j g i j g_(ij)g_{i j}gij and g ˙ i j g ˙ i j g^(˙)_(ij)\dot{g}_{i j}g˙ij and the lapse and shift functions N N NNN and N i N i N_(i)N_{i}Ni by (21.67) or by (21.67) plus (21.91) or by (21.114). The four initial-value or "constraint" equations (21.116) and (21.117) thus become four conditions for finding the four
Scalar potential: fixed at start; freely disposable later
In ADM treatment, give 3 -geometry and its time-rate of change
quantities N , N i N , N i N,N_(i)N, N_{i}N,Ni. One can shorten the writing of these conditions by introducing the abbreviations
(21.124) γ i j = 1 2 [ N i j + N j i g i j / t ] (21.124) γ i j = 1 2 N i j + N j i g i j / t {:(21.124)gamma_(ij)=(1)/(2)[N_(i∣j)+N_(j∣i)-delg_(ij)//del t]:}\begin{equation*} \gamma_{i j}=\frac{1}{2}\left[N_{i \mid j}+N_{j \mid i}-\partial g_{i j} / \partial t\right] \tag{21.124} \end{equation*}(21.124)γij=12[Nij+Njigij/t]
and
(21.125) γ 2 = ( "shift anomaly" " ) = ( Tr γ ) 2 Tr γ 2 (21.125) γ 2 = (  "shift   anomaly" "  ) = ( Tr γ ) 2 Tr γ 2 {:(21.125)gamma_(2)=((" "shift ")/(" anomaly" " "))=(Tr gamma)^(2)-Trgamma^(2):}\begin{equation*} \gamma_{2}=\binom{\text { "shift }}{\text { anomaly" " }}=(\operatorname{Tr} \boldsymbol{\gamma})^{2}-\operatorname{Tr} \boldsymbol{\gamma}^{2} \tag{21.125} \end{equation*}(21.125)γ2=( "shift  anomaly" " )=(Trγ)2Trγ2
(both for functions of x , y , z x , y , z x,y,zx, y, zx,y,z on the initial simultaneity). Then one has
(21.126) ( 3 ) R + γ 2 / N 2 = 16 π T n n = 16 π T n n (21.126) ( 3 ) R + γ 2 / N 2 = 16 π T n n = 16 π T n n {:(21.126)^((3))R+gamma_(2)//N^(2)=16 piT_(nn)=16 piT^(nn):}\begin{equation*} { }^{(3)} R+\gamma_{2} / N^{2}=16 \pi T_{n n}=16 \pi T^{n n} \tag{21.126} \end{equation*}(21.126)(3)R+γ2/N2=16πTnn=16πTnn
for the one initial-value equation; and for the other three,
(21.127) [ γ i k δ i k Tr γ N ] k = 8 π T i n (21.127) γ i k δ i k Tr γ N k = 8 π T i n {:(21.127)[(gamma_(i)^(k)-delta_(i)^(k)Tr gamma)/(N)]_(∣k)=-8piT_(i)^(n):}\begin{equation*} \left[\frac{\gamma_{i}{ }^{k}-\delta_{i}{ }^{k} \operatorname{Tr} \gamma}{N}\right]_{\mid k}=-8 \pi T_{i}{ }^{n} \tag{21.127} \end{equation*}(21.127)[γikδikTrγN]k=8πTin
In summary, one chooses the g i j g i j g_(ij)g_{i j}gij and g ˙ i j g ˙ i j g^(˙)_(ij)\dot{g}_{i j}g˙ij with much arbitrariness (because of the arbitrariness in the coordinates, not by reason of any arbitrariness in the physics)
Lapse and shift initially determinate; thereafter freely disposable
Counting initial-value data
to represent the given initial-value data, ( 3 ) y ( 3 ) y ^((3))y{ }^{(3)} y(3)y and ( 3 ) ( 3 ) ^((3)){ }^{(3)}(3) g. The arbitrariness at the initial time all having been soaked up in this way, one expects no arbitrariness to be left in the initial N N NNN and N i N i N_(i)N_{i}Ni as obtained by solving (21.126) and (21.127). However, on all later spacelike slices, the award of the arbitrariness is reversed. The lapse and shift functions are freely disposable, but, with them once chosen, there is no arbitrariness whatever in the six g i j g i j g_(ij)g_{i j}gij (and the six K i j K i j K^(ij)K^{i j}Kij or π i j π i j pi^(ij)\pi^{i j}πij ) as given by the integration of the dynamic equations (21.114) and (21.115). The analogy with electrodynamics is clear. There the one "gauge-controlled" function ϕ ϕ phi\phiϕ was fixed at the start by the elliptic equation (21.121), but was thereafter free. Here the four lapse and shift functions are fixed at the start by the four equations (21.126) and (21.127), but are thereafter free.
Exercise 21.16 applies the initial-value equation (21.126) to analyze the whole evolution in time of any Friedmann universe in which one knows the equation p = p ( ρ ) p = p ( ρ ) p=p(rho)p=p(\rho)p=p(ρ) connecting pressure with density. Exercise 21.17 looks for a variation principle on the spacelike hypersurface Σ Σ Sigma\SigmaΣ equivalent in content to the elliptic initial-value equation (21.121) for the scalar potential ϕ ϕ phi\phiϕ. Exercises 21.18 and 21.19 look for similar variation principles to determine the lapse and shift functions.
How many degrees of freedom, or how many "handles," are there in the specification of the 4 -geometry that one will obtain? The metric coefficients of the initial 3 -geometry provided six numbers per space point. However, they were arbitrary to the extent of a coordinate transformation, specified by three functions of position,
x = x ( x , y , z ) , y = y ( x , y , z ) , z = z ( x , y , z ) . x = x x , y , z , y = y x , y , z , z = z x , y , z . {:[x=x(x^('),y^('),z^('))","],[y=y(x^('),y^('),z^('))","],[z=z(x^('),y^('),z^(')).]:}\begin{aligned} & x=x\left(x^{\prime}, y^{\prime}, z^{\prime}\right), \\ & y=y\left(x^{\prime}, y^{\prime}, z^{\prime}\right), \\ & z=z\left(x^{\prime}, y^{\prime}, z^{\prime}\right) . \end{aligned}x=x(x,y,z),y=y(x,y,z),z=z(x,y,z).
The net number of quantities per space point with any physical information was therefore 6 3 = 3 6 3 = 3 6-3=36-3=363=3. One can visualize these three functions as the three diagonal components of the metric in a coordinate system in which g i j g i j g_(ij)g_{i j}gij has been transformed to diagonal form. Ordinarily it is not useful to go further and actually spell out the analysis in any such narrowly circumscribed coordinate system.
Now think of the ( 3 ) y ( 3 ) y ^((3))y{ }^{(3)} y(3)y in question as imbedded in the ( 4 ) y ( 4 ) y ^((4))y{ }^{(4)} y(4)y that comes out of the integrations. Moreover, think of that ( 4 ) y ( 4 ) y ^((4))y{ }^{(4)} y(4)y as endowed with the lumps, bumps, wiggles, and waves that distinguish it from other generic 4 -geometries and that make Minkowski geometry and special cosmologies so unrepresentative. The ( 3 ) y ( s ) ( 3 ) y ( s ) ^((3))y^((s)){ }^{(3)} y^{(s)}(3)y(s) is slice in that ( 4 ) ( 4 ) ^((4)){ }^{(4)}(4) ). It partakes of the lumps, bumps, wiggles, and waves present in all those regions of the ( 4 ) y ( 4 ) y ^((4))y{ }^{(4)} y(4)y that it intersects. To the extent that the ( 4 ) y ( 4 ) y ^((4))y{ }^{(4)} y(4)y is generic, it does not allow the ( 3 ) y ( 3 ) y ^((3))y{ }^{(3)} \mathscr{y}(3)y to be moved to another location without becoming a different ( 3 ) y ( 3 ) y ^((3))y{ }^{(3)} y(3)y. If one tries to push the ( 3 ) ( 3 ) ^((3)){ }^{(3)}(3) "forward in time" a little in a certain locality, leaving it unchanged in location elsewhere, one necessarily changes the ( 3 ) ( 3 ) ^((3))ℓ{ }^{(3)} \ell(3). By this circumstance, one sees that the ( 3 ) ( 3 ) ^((3)){ }^{(3)}(3) y "carries information about time" [Sharp (1960); Baierlein, Sharp, and Wheeler (1962)]. Moreover, this "forward motion in time" demands for its description one number per space point. It is possible to think of this number in concrete terms by imagining an arbitrary coordinate system t ¯ , x ¯ , y ¯ , z ¯ t ¯ , x ¯ , y ¯ , z ¯ bar(t), bar(x), bar(y), bar(z)\bar{t}, \bar{x}, \bar{y}, \bar{z}t¯,x¯,y¯,z¯ laid down in the ( 4 ) y ( 4 ) y ^((4))y{ }^{(4)} y(4)y. Then the hypersurface can be conceived as defined by the value t ¯ = t ¯ = bar(t)=\bar{t}=t¯= t ¯ ( x ¯ , y ¯ , z ¯ ) t ¯ ( x ¯ , y ¯ , z ¯ ) bar(t)( bar(x), bar(y), bar(z))\bar{t}(\bar{x}, \bar{y}, \bar{z})t¯(x¯,y¯,z¯) at which it cuts the typical line x ¯ , y ¯ , z ¯ x ¯ , y ¯ , z ¯ bar(x), bar(y), bar(z)\bar{x}, \bar{y}, \bar{z}x¯,y¯,z¯. A forward movement carries it to t ¯ ( x ¯ , y ¯ , z ¯ ) + δ t ¯ ( x ¯ , y ¯ , z ¯ ) t ¯ ( x ¯ , y ¯ , z ¯ ) + δ t ¯ ( x ¯ , y ¯ , z ¯ ) bar(t)( bar(x), bar(y), bar(z))+delta bar(t)( bar(x), bar(y), bar(z))\bar{t}(\bar{x}, \bar{y}, \bar{z})+\delta \bar{t}(\bar{x}, \bar{y}, \bar{z})t¯(x¯,y¯,z¯)+δt¯(x¯,y¯,z¯), and changes shape and metric coefficients on ( 3 ) ( 3 ) ^((3)){ }^{(3)}(3) ) accordingly. It is usually better not to tie one's thinking down to such a concrete model, but rather to recognize as a general point of principle (1) that the location of the ( 3 ) ( 3 ) ^((3)){ }^{(3)}(3) y in spacetime demands for its specification one datum per spacepoint, and (2) that this datum is already willy-nilly present in the three data per spacepoint that mark any ( 3 ) ( 3 ) ^((3)){ }^{(3)}(3).
In conclusion, there are only two data per spacepoint in a ( 3 ) ( 3 ) ^((3)){ }^{(3)}(3) 多 that really tell anything about the ( 4 ) y ( 4 ) y ^((4))y{ }^{(4)} y(4)y in which it is imbedded, or to be imbedded (as distinguished from where the ( 3 ) y ( 3 ) y ^((3))y{ }^{(3)} y(3)y slices through that ( 4 ) y ) ( 4 ) y {:^((4))y)\left.{ }^{(4)} y\right)(4)y). Similarly for the other ( 3 ) y ( 3 ) y ^((3))y{ }^{(3)} y(3)y that defines the other "face of the sandwich," whether thick or thin. Thus one concludes that the specification of ( 3 ) y ( 3 ) y ^((3))y{ }^{(3)} \mathscr{y}(3)y and ( 3 ) y ˙ ( 3 ) y ˙ ^((3))y^(˙){ }^{(3)} \dot{y}(3)y˙ actually gives four net pieces of dynamic information per spacepoint about the ( 4 ) y ( 4 ) y ^((4))y{ }^{(4)} y(4)y (all the rest of the information being "many-fingered time," telling where the 3 -geometries are located in that ( 4 ) ) ( 4 ) {:^((4))ℓ)\left.{ }^{(4)} \ell\right)(4)). According to this line of reasoning, geometrodynamics has the same number of dynamic degrees of freedom as electrodynamics. One arrives at the same conclusion in quite another way through the weak-field analysis (§35.3) of gravitational waves on a flat spacetime background: the same ranges of possible wave numbers as for Maxwell waves; and for each wave number two states of polarization; and for each polarization one amplitude and one phase (the equivalent of one coordinate and one momentum).
In electrodynamics in a prescribed spacetime manifold, one has a clean separation between the one time-datum per spacepoint (when one deals with electromagnetism in the context of many-fingered time) and the two dynamic variables per spacepoint; but not so in the superspace formulation of geometrodynamics. There the two kinds of quantities are inextricably mixed together in the one concept of 3-geometry.
Four pieces of geometrodynamic information per space point on initial simultaneity
Turn from initial- and final-value data to the action integral that is determined by (1) these data and (2) the principle that the action be an extremum,
I = I extremum = S . I = I extremum  = S . I=I_("extremum ")=S.I=I_{\text {extremum }}=S .I=Iextremum =S.
The action depends on the variables on the final hypersurface, according to the formula
(21.128) S = S ( Σ , B ) (21.128) S = S ( Σ , B ) {:(21.128)S=S(Sigma","B):}\begin{equation*} S=S(\Sigma, \boldsymbol{B}) \tag{21.128} \end{equation*}(21.128)S=S(Σ,B)
in electrodynamics, but according to the formula
(21.129) S = S ( ( 3 ) y ) (21.129) S = S ( 3 ) y {:(21.129)S=S(^((3))y):}\begin{equation*} S=S\left({ }^{(3)} y\right) \tag{21.129} \end{equation*}(21.129)S=S((3)y)
in geometrodynamics. In each case, there are three numbers per spacepoint in the argument of the functional (one in Σ Σ Sigma\SigmaΣ; two in a divergence-free magnetic field; three in ( 3 ) ( 3 ) ^((3)){ }^{(3)}(3) ) ).
This mixing of the one many-fingered time and the two dynamic variables in a 3-geometry makes it harder in general relativity than in Maxwell theory to know when one has in hand appropriate initial value data. Give Σ Σ Sigma\SigmaΣ and give B B B\mathscr{B}B and B B B\mathscr{B}B on Σ Σ Sigma\SigmaΣ : that was enough for electrodynamics. For geometrodynamics, to give the six g i j ( x , y , z ) g i j ( x , y , z ) g_(ij)(x,y,z)g_{i j}(x, y, z)gij(x,y,z) and the six g ˙ i j ( x , y , z ) g ˙ i j ( x , y , z ) g^(˙)_(ij)(x,y,z)\dot{g}_{i j}(x, y, z)g˙ij(x,y,z) is not necessarily enough. For example, let the time parameter t t ttt be a fake, so that d t d t dtd tdt, instead of leading forward from a given hypersurface Σ Σ Sigma\SigmaΣ to a new hypersurface Σ + d Σ Σ + d Σ Sigma+d Sigma\Sigma+d \SigmaΣ+dΣ, merely recoordinatizes the present hypersurface:
x i x i ξ i d t , (21.130) g i j g i j + ( ξ i j j + ξ j i ) d t . x i x i ξ i d t , (21.130) g i j g i j + ξ i j j + ξ j i d t . {:[x^(i)longrightarrowx^(i)-xi^(i)dt","],[(21.130)g_(ij)longrightarrowg_(ij)+(xi_(ijj)+xi_(j∣i))dt.]:}\begin{gather*} x^{i} \longrightarrow x^{i}-\xi^{i} d t, \\ g_{i j} \longrightarrow g_{i j}+\left(\xi_{i j j}+\xi_{j \mid i}\right) d t . \tag{21.130} \end{gather*}xixiξidt,(21.130)gijgij+(ξijj+ξji)dt.
A first inspection may make one think that one has adequate data in the six g i j g i j g_(ij)g_{i j}gij and the six
(21.131) g ˙ i j = ξ i j + ξ j i , (21.131) g ˙ i j = ξ i j + ξ j i , {:(21.131)g^(˙)_(ij)=xi_(i∣j)+xi_(j∣i)",":}\begin{equation*} \dot{g}_{i j}=\xi_{i \mid j}+\xi_{j \mid i}, \tag{21.131} \end{equation*}(21.131)g˙ij=ξij+ξji,
but in the end one sees that one has not both faces of the thin sandwich, as required, but only one. Thus one must reject, as improperly posed data in the generic problem of dynamics, any set of six g ˙ i j g ˙ i j g^(˙)_(ij)\dot{g}_{i j}g˙ij that let themselves be expressed in the form (21.131) [Belasco and Ohanian (1969)].
Similar difficulties occur when the two faces of the thin sandwich, instead of coinciding everywhere, coincide in a limited region, be it three-dimensional, twodimensional, or even one-dimensional ("crossover of one face from being earlier than the other to being later"). Thus it is enough to have (21.131) obtaining even on only a curved line in Σ Σ Sigma\SigmaΣ to reject the six g i j g i j g_(ij)g_{i j}gij as inappropriate initial-value data.
That one can impose conditions on the g i j g i j g_(ij)g_{i j}gij and g ˙ i j g ˙ i j g^(˙)_(ij)\dot{g}_{i j}g˙ij which will guarantee existence and uniqueness of the solution N ( x , y , z ) , N i ( x , y , z ) N ( x , y , z ) , N i ( x , y , z ) N(x,y,z),N_(i)(x,y,z)N(x, y, z), N_{i}(x, y, z)N(x,y,z),Ni(x,y,z) of the initial-value equations (21.126) and (21.127) is known as the "thin-sandwich conjecture," a topic on which there has been much work by many investigators, but so far no decisive theorem.
To presuppose existence and uniqueness is to make the first step in giving mathematical content to Mach's principle that the distribution of mass-energy throughout space determines inertia ($21.12).

§21.10. THE TIME-SYMMETRIC AND TIME-ANTISYMMETRIC INITIAL-VALUE PROBLEMS

Turn from the general initial-value problem to two special initial-value problems that lend themselves to detailed treatment, one known as the time-symmetric ini-tial-value problem, the other as the time-antisymmetric problem.
A 4-geometry is said to be time-symmetric when there exists a spacelike hypersurface Σ Σ Sigma\SigmaΣ at all points of which the extrinsic curvature vanishes. In this case the three initial value equations (21.127) are automatically satisfied, and the fourth reduces to a simple requirement on the three-dimensional scalar curvature invariant,
(21.132) R = 16 π ρ . (21.132) R = 16 π ρ . {:(21.132)R=16 pi rho.:}\begin{equation*} R=16 \pi \rho . \tag{21.132} \end{equation*}(21.132)R=16πρ.
Still further simplifications result when one limits attention to empty space. Simplest of all is the case of spherical symmetry in which (21.132) yields at once the full Schwarzschild geometry at the moment of time symmetry (two asymptotically flat spaces connected by a throat), as developed in exercise 21.20.
Consider a 3 -geometry with metric
(21.133) d s 1 2 = g ( 1 ) i k d x i d x k (21.133) d s 1 2 = g ( 1 ) i k d x i d x k {:(21.133)ds_(1)^(2)=g_((1)ik)dx^(i)dx^(k):}\begin{equation*} d s_{1}^{2}=g_{(1) i k} d x^{i} d x^{k} \tag{21.133} \end{equation*}(21.133)ds12=g(1)ikdxidxk
Call it a "base metric." Consider another 3-geometry with metric
(21.134) d s 2 2 = ψ 4 ( x i ) d s 1 2 (21.134) d s 2 2 = ψ 4 x i d s 1 2 {:(21.134)ds_(2)^(2)=psi^(4)(x^(i))ds_(1)^(2):}\begin{equation*} d s_{2}^{2}=\psi^{4}\left(x^{i}\right) d s_{1}^{2} \tag{21.134} \end{equation*}(21.134)ds22=ψ4(xi)ds12
Angles are identical in the two geometries. On this account they are said to be conformally equivalent. The scalar curvature invariants of the two 3-geometries are related by the formula [Eisenhart (1926)]
(21.135) R 2 = 8 ψ 5 1 2 ψ + ψ 4 R 1 , (21.135) R 2 = 8 ψ 5 1 2 ψ + ψ 4 R 1 , {:(21.135)R_(2)=-8psi^(-5)grad_(1)^(2)psi+psi^(-4)R_(1)",":}\begin{equation*} R_{2}=-8 \psi^{-5} \nabla_{1}^{2} \psi+\psi^{-4} R_{1}, \tag{21.135} \end{equation*}(21.135)R2=8ψ512ψ+ψ4R1,
where
(21.136) 1 2 ψ = ψ i i = g 1 1 / 2 ( / x i ) [ g 1 1 / 2 g i k ( ψ / x k ) ] (21.136) 1 2 ψ = ψ i i = g 1 1 / 2 / x i g 1 1 / 2 g i k ψ / x k {:(21.136)grad_(1)^(2)psi=psi_(∣i)^(∣i)=g_(1)^(-1//2)(del//delx^(i))[g_(1)^(1//2)g^(ik)(del psi//delx^(k))]:}\begin{equation*} \boldsymbol{\nabla}_{1}^{2} \psi=\psi_{\mid i}^{\mid i}=g_{1}^{-1 / 2}\left(\partial / \partial x^{i}\right)\left[g_{1}^{1 / 2} g^{i k}\left(\partial \psi / \partial x^{k}\right)\right] \tag{21.136} \end{equation*}(21.136)12ψ=ψii=g11/2(/xi)[g11/2gik(ψ/xk)]
Demand that the scalar curvature invariant R 2 R 2 R_(2)R_{2}R2 vanish, and arrive [Brill (1959)] at the "wave equation"
(21.137) 1 2 ψ ( R 1 / 8 ) ψ = 0 (21.137) 1 2 ψ R 1 / 8 ψ = 0 {:(21.137)grad_(1)^(2)psi-(R_(1)//8)psi=0:}\begin{equation*} \boldsymbol{\nabla}_{1}{ }^{2} \psi-\left(R_{1} / 8\right) \psi=0 \tag{21.137} \end{equation*}(21.137)12ψ(R1/8)ψ=0
for the conformal correction factor ψ ψ psi\psiψ. Brill takes the base metric to have the form suggested by Bondi,
(21.138) d s 1 2 = e 2 A q 1 ( ρ , z ) ( d z 2 + d ρ 2 ) + ρ 2 d ϕ 2 (21.138) d s 1 2 = e 2 A q 1 ( ρ , z ) d z 2 + d ρ 2 + ρ 2 d ϕ 2 {:(21.138)ds_(1)^(2)=e^(2Aq_(1)(rho,z))(dz^(2)+drho^(2))+rho^(2)dphi^(2):}\begin{equation*} d s_{1}{ }^{2}=e^{2 A q_{1}(\rho, z)}\left(d z^{2}+d \rho^{2}\right)+\rho^{2} d \phi^{2} \tag{21.138} \end{equation*}(21.138)ds12=e2Aq1(ρ,z)(dz2+dρ2)+ρ2dϕ2
and takes the conformal correction factor ψ ψ psi\psiψ also to possess axial symmetry. In the application:
q 1 ( ρ , z ) q 1 ( ρ , z ) q_(1)(rho,z)q_{1}(\rho, z)q1(ρ,z) measures the "distribution of gravitational wave amplitude," assumed for simplicity to vanish outside r = ( ρ 2 + z 2 ) 1 / 2 = a r = ρ 2 + z 2 1 / 2 = a r=(rho^(2)+z^(2))^(1//2)=ar=\left(\rho^{2}+z^{2}\right)^{1 / 2}=ar=(ρ2+z2)1/2=a;
A measures the "amplitude of the distribution of gravitational wave amplitude";
ψ ( ρ , z ) ψ ( ρ , z ) psi(rho,z)\psi(\rho, z)ψ(ρ,z) is the conformal correction factor, which varies with position at large distances as 1 + ( m / 2 r ) 1 + ( m / 2 r ) 1+(m//2r)1+(m / 2 r)1+(m/2r). The quantity m ( cm ) m ( cm ) m(cm)m(\mathrm{~cm})m( cm) is uniquely determined by the condition that the geometry be asymtotically flat. It measures the mass-energy of the distribution of gravitational radiation.
Wave amplitude determines mass-energy: m = m ( A ) m = m ( A ) m=m(A)m=m(A)m=m(A)
"Time-antisymmetric" initial-value data
The mass m m mmm of the gravitational radiation is proportional to A 2 A 2 A^(2)A^{2}A2 for small values of the amplitude A A AAA. It is inversely proportional to the reduced wavelength λ = λ = lambda=\lambda=λ= (effective wavelength / 2 π / 2 π //2pi/ 2 \pi/2π ) that measures the scale of rapid variations in the gravitational wave amplitude q 1 ( ρ , z ) q 1 ( ρ , z ) q_(1)(rho,z)q_{1}(\rho, z)q1(ρ,z) in the "active zone." Thus the metric is dominated by wiggles, proportional in amplitude to A A AAA, in the active zone, and at larger distances dominated by something close to a Schwarzschild ( 1 + 2 m / r ) ( 1 + 2 m / r ) (1+2m//r)(1+2 m / r)(1+2m/r) factor in the metric. When the amplitude A A AAA is increased, a critical value is attained, A = A crit A = A crit  A=A_("crit ")A=A_{\text {crit }}A=Acrit , at which m m mmm goes to infinity and the geometry curves up into closure ("universe closed by its own content of gravitational-wave energy"). Further analysis and examples will be found in Wheeler (1964a), pp. 399-451, also in Wheeler (1964c).
Brill has carried out a similar analysis [Brill (1961)] for the vacuum case of what he calls time-antisymmetric initial-value conditions, sketched below as amended by York (1973). (1) The initial slice is maximal, Tr K = 0 Tr K = 0 Tr K=0\operatorname{Tr} \boldsymbol{K}=0TrK=0. (2) This slice is conformally flat,
(21.139) g i j = ψ 4 δ i j (21.139) g i j = ψ 4 δ i j {:(21.139)g_(ij)=psi^(4)delta_(ij):}\begin{equation*} g_{i j}=\psi^{4} \delta_{i j} \tag{21.139} \end{equation*}(21.139)gij=ψ4δij
(3) Work in the "base space" with metric δ i j δ i j delta_(ij)\delta_{i j}δij and afterwards transform to the geometry (21.139). Three of the initial-value equations become
(21.140) K base , j i j = 0 . (21.140) K base  , j i j = 0 . {:(21.140)K_("base ",j)^(ij)=0.:}\begin{equation*} K_{\text {base }, j}^{i j}=0 . \tag{21.140} \end{equation*}(21.140)Kbase ,jij=0.
To solve these equations, (1) take any localized trace-free symmetric tensor B k m B k m B_(km)B_{k m}Bkm; (2) solve the flat-space Laplace equation 2 A = ( 3 / 2 ) 2 B k m / x k x m 2 A = ( 3 / 2 ) 2 B k m / x k x m grad^(2)A=(3//2)del^(2)B_(km)//delx^(k)delx^(m)\nabla^{2} A=(3 / 2) \partial^{2} B_{k m} / \partial x^{k} \partial x^{m}2A=(3/2)2Bkm/xkxm for A A AAA;
(3) define the six potentials A k m = B k m + 1 3 A δ k m A k m = B k m + 1 3 A δ k m A_(km)=B_(km)+(1)/(3)Adelta_(km)A_{k m}=B_{k m}+\frac{1}{3} A \delta_{k m}Akm=Bkm+13Aδkm; and (4) calculate
(21.141) K base i j = [ i k ] [ j m n ] 2 A k m / x x n , (21.141) K base  i j = [ i k ] [ j m n ] 2 A k m / x x n , {:(21.141)K_("base ")^(ij)=[ikℓ][jmn]del^(2)A_(km)//delx^(ℓ)delx^(n)",":}\begin{equation*} K_{\text {base }}^{i j}=[i k \ell][j m n] \partial^{2} A_{k m} / \partial x^{\ell} \partial x^{n}, \tag{21.141} \end{equation*}(21.141)Kbase ij=[ik][jmn]2Akm/xxn,
that automatically satisfy (21.140) and give Tr K base = 0 Tr K base  = 0 TrK_("base ")=0\operatorname{Tr} \boldsymbol{K}_{\text {base }}=0TrKbase =0. Then K i j = ψ 10 K base i j K i j = ψ 10 K base  i j K^(ij)=psi^(-10)K_("base ")^(ij)K^{i j}=\psi^{-10} K_{\text {base }}^{i j}Kij=ψ10Kbase ij also automatically satisfies these conditions, but now in the curved geometry (21.139). The final initial-value equation becomes a quasilinear elliptic equation, in the flat base space, for the conformal factor ψ ψ psi\psiψ,
(21.142) 8 base 2 ψ + ψ 7 i , j ( K base i j ) 2 = 0 . (21.142) 8 base  2 ψ + ψ 7 i , j K base  i j 2 = 0 . {:(21.142)8grad_("base ")^(2)psi+psi^(-7)sum_(i,j)(K_("base "ij))^(2)=0.:}\begin{equation*} 8 \nabla_{\text {base }}^{2} \psi+\psi^{-7} \sum_{i, j}\left(K_{\text {base } i j}\right)^{2}=0 . \tag{21.142} \end{equation*}(21.142)8base 2ψ+ψ7i,j(Kbase ij)2=0.
The asymptotic form of ψ ψ psi\psiψ reveals that the mass of the wave is positive.
In addition to the time-symmetric and time-antisymmetric cases, there are at least two further cases where the initial-value problem possess special simplicity. One is the case of a geometry endowed with a symmetry, as, for example, for the Friedmann universe of Chapter 27 or the mixmaster universe of Chapter 30 or cylindrical gravitational waves in the treatment of Kuchař (1971a). One starts with a spacelike slice on which the g i j g i j g_(ij)g_{i j}gij and π i j π i j pi^(ij)\pi^{i j}πij have a special symmetry, and makes all future spacelike slices in a way that preserves this symmetry. The geometry on any one of these simultaneities, though almost entirely governed by these symmetry considerations, still typically demands some countable number of parameters for its complete determination, such as the radius of the Friedmann universe, or the three principal radii of curvature of the mixmaster universe. These parameters and the momenta conjugate to them define a miniphase space. In this miniphase space, the dynamics runs its course as for any other problem of classical dynamics [see, for example, Box 30.1 and Misner (1969) for the mixmaster universe; Kuchař (1971a) and (1972) for waves endowed with cylindrical symmetry; and Gowdy (1973) for waves with spherical symmetry]. Even the evidence for the existence of many-fingered time, most characteristic feature of general relativity, is suppressed as the price for never having to give attention to any spacelike slice that departs from the prescribed symmetry.
Finite dimensional dynamics for geometries endowed with high symmetry

Exercise 21.16. POOR MAN'S WAY TO DO COSMOLOGY

Consider a spacetime with the metric
d s 2 = d t 2 + a 2 ( t ) [ d χ 2 + sin 2 χ ( d θ 2 + sin 2 θ d ϕ ˙ 2 ) ] d s 2 = d t 2 + a 2 ( t ) d χ 2 + sin 2 χ d θ 2 + sin 2 θ d ϕ ˙ 2 ds^(2)=-dt^(2)+a^(2)(t)[dchi^(2)+sin^(2)chi(dtheta^(2)+sin^(2)theta dphi^(˙)^(2))]d s^{2}=-d t^{2}+a^{2}(t)\left[d \chi^{2}+\sin ^{2} \chi\left(d \theta^{2}+\sin ^{2} \theta d \dot{\phi}^{2}\right)\right]ds2=dt2+a2(t)[dχ2+sin2χ(dθ2+sin2θdϕ˙2)]
corresponding to a 3-geometry with the form of a sphere of radius a ( t ) a ( t ) a(t)a(t)a(t) changing with time. Show that the tensor of extrinsic curvature as expressed in a local Euclidean frame of reference is
K = a 1 ( d a / d t ) 1 K = a 1 ( d a / d t ) 1 K=-a^(-1)(da//dt)1\boldsymbol{K}=-a^{-1}(d a / d t) \mathbf{1}K=a1(da/dt)1
where 1 is the unit tensor. Show that the initial value equation (21.77) reduces to
( 6 / a 2 ) ( d a / d t ) 2 + ( 6 / a 2 ) = 16 π ρ ( a ) 6 / a 2 ( d a / d t ) 2 + 6 / a 2 = 16 π ρ ( a ) (6//a^(2))(da//dt)^(2)+(6//a^(2))=16 pi rho(a)\left(6 / a^{2}\right)(d a / d t)^{2}+\left(6 / a^{2}\right)=16 \pi \rho(a)(6/a2)(da/dt)2+(6/a2)=16πρ(a)
[for the value of the second term on the left, see exercise 14.3 and Boxes 14.2 and 14.5], and explain why it is appropriate to write the term on the right as 6 a 0 / a 3 6 a 0 / a 3 6a_(0)//a^(3)6 a_{0} / a^{3}6a0/a3 for a "dust-filled model universe." More generally, given any equation of state, p = p ( ρ ) p = p ( ρ ) p=p(rho)p=p(\rho)p=p(ρ), explain how one can find ρ = ρ ( a ) ρ = ρ ( a ) rho=rho(a)\rho=\rho(a)ρ=ρ(a) from
d ( ρ a 3 ) = p d ( a 3 ) d ρ a 3 = p d a 3 d(rhoa^(3))=-pd(a^(3))d\left(\rho a^{3}\right)=-p d\left(a^{3}\right)d(ρa3)=pd(a3)
and how one can thus forecast the history of expansion and recontraction, a = a ( t ) a = a ( t ) a=a(t)a=a(t)a=a(t).

Exercise 21.17. THIN-SANDWICH VARIATIONAL PRINCIPLE FOR the scalar potential in electrodynamics

(a) Choose the unknown U m U m U^(m)U^{m}Um in the expression
1 8 π g m n ϕ x m ϕ x n + U m ϕ x n 1 8 π g m n ϕ x m ϕ x n + U m ϕ x n (1)/(8pi)g^(mn)(del phi)/(delx^(m))(del phi)/(delx^(n))+U^(m)(del phi)/(delx^(n))\frac{1}{8 \pi} g^{m n} \frac{\partial \phi}{\partial x^{m}} \frac{\partial \phi}{\partial x^{n}}+U^{m} \frac{\partial \phi}{\partial x^{n}}18πgmnϕxmϕxn+Umϕxn
in such a way that this expression, multiplied by the volume element g 1 / 2 d 3 x g 1 / 2 d 3 x g^(1//2)d^(3)xg^{1 / 2} d^{3} xg1/2d3x, and integrated over the simultaneity Σ Σ Sigma\SigmaΣ, is extremized by a ϕ ϕ phi\phiϕ, and only by a ϕ ϕ phi\phiϕ, that satisfies the initial-value equation (21.108) of electrodynamics.
(b) Show that the resulting variational principle, instead of having to be invented "out of the blue," is none other than what follows directly from the action principle build on the Lagrangian density (21.100) of electrodynamics (independent variation of ϕ ϕ phi\phiϕ and the three A i A i A_(i)A_{i}Ai everywhere between the two faces of a sandwich to extremize I I III, subject only to the prior specification of the A i A i A_(i)A_{i}Ai on the two faces of the sandwich, in the limit where the thickness of the sandwich goes to zero).
Exercise 21.18. THIN-SANDWICH VARIATIONAL PRINCIPLE FOR THE LAPSE AND SHIFT FUNCTIONS IN GEOMETRODYNAMICS
(a) Extremize the action integral
I 3 = { [ R ( Tr K ) 2 + Tr K 2 2 T n n ] N 2 T n k N k } g 1 / 2 d 3 x I 3 = R ( Tr K ) 2 + Tr K 2 2 T n n N 2 T n k N k g 1 / 2 d 3 x {:[I_(3)= int{[R-(Tr K)^(2)+TrK^(2)-2T_(nn)^(**)]N:}],[{:-2T_(n)^(**k)N_(k)}g^(1//2)d^(3)x]:}\begin{aligned} I_{3}= & \int\left\{\left[R-(\operatorname{Tr} \boldsymbol{K})^{2}+\operatorname{Tr} \boldsymbol{K}^{2}-2 T_{n n}^{*}\right] N\right. \\ & \left.-2 T_{n}^{* k} N_{k}\right\} g^{1 / 2} d^{3} x \end{aligned}I3={[R(TrK)2+TrK22Tnn]N2TnkNk}g1/2d3x
with respect to the lapse and shift functions, and show that one arrives in this way at the four initial-value equations of geometrodynamics. It is understood that one has given the six g i j six g i j sixg_(ij)\operatorname{six} g_{i j}sixgij and the six g i j / t six g i j / t six delg_(ij)//del t\operatorname{six} \partial g_{i j} / \partial tsixgij/t on the simultaneity where the analysis is being done. The extrinsic curvature is considered to be expressed as in (21.67) in terms of these quantities and the lapse and shift. The energy density and energy flow are referred to a unit normal vector n n n\boldsymbol{n}n and three arbitrary coordinate basis vectors e i e i e_(i)\boldsymbol{e}_{\boldsymbol{i}}ei within the simultaneity, as earlier in this chapter, and the asterisk is an abbreviation for an omitted factor of 8 π 8 π 8pi8 \pi8π.
(b) Derive this variational principle from the ADM variational principle by going to the limit of an infinitesimally thin sandwich [see derivation in Wheeler (1964)].

Exercise 21.19. CONDENSED THIN-SANDWICH VARIATIONAL PRINCIPLE

(a) Extremize the action I 3 I 3 I_(3)I_{3}I3 of the preceding exercise with respect to the lapse function N N NNN.
(b) What is the relation between the result and the principle that "3-geometry is a carrier of information about time"?
(c) By elimination of N N NNN, arrive at a "condensed thin-sandwich variational principle" in which the only quantities to be varied are the three shift functions N i N i N_(i)N_{i}Ni.

Exercise 21.20. POOR MAN'S WAY TO SCHWARZSCHILD GEOMETRY

On curved empty space evolving deterministically in time, impose the conditions (1) that it possess a moment of time-symmetry, a spacelike hypersurface, the extrinsic curvature of which, with respect to the enveloping spacetime, is everywhere zero, and (2) that this spacelike hypersurface be endowed with spherical symmetry. Write the metric of the 3-geometry in the form
d s 2 = ψ 4 ( r ¯ ) ( d r ¯ 2 + r ¯ 2 d θ 2 + r ¯ 2 sin 2 θ d ϕ 2 ) d s 2 = ψ 4 ( r ¯ ) d r ¯ 2 + r ¯ 2 d θ 2 + r ¯ 2 sin 2 θ d ϕ 2 ds^(2)=psi^(4)( bar(r))(d bar(r)^(2)+ bar(r)^(2)dtheta^(2)+ bar(r)^(2)sin^(2)theta dphi^(2))d s^{2}=\psi^{4}(\bar{r})\left(d \bar{r}^{2}+\bar{r}^{2} d \theta^{2}+\bar{r}^{2} \sin ^{2} \theta d \phi^{2}\right)ds2=ψ4(r¯)(dr¯2+r¯2dθ2+r¯2sin2θdϕ2)
From the initial-value equation (21.127), show that the conformal factor ψ ψ psi\psiψ up to a multiplicative factor must have the form ψ = ( 1 + m / 2 r ¯ ) ψ = ( 1 + m / 2 r ¯ ) psi=(1+m//2 bar(r))\psi=(1+m / 2 \bar{r})ψ=(1+m/2r¯). Show that the proper circumference 2 π r ¯ ψ 2 ( r ¯ ) 2 π r ¯ ψ 2 ( r ¯ ) 2pi bar(r)psi^(2)( bar(r))2 \pi \bar{r} \psi^{2}(\bar{r})2πr¯ψ2(r¯) assumes a minimum value at a certain value of r ¯ r ¯ bar(r)\bar{r}r¯, thus defining the throat of the 3-geometry. Show that the 3-geometry is mirror-symmetric with respect to reflection in this throat in the sense that the metric is unchanged in form under the substitution r = m 2 / 4 r ¯ r = m 2 / 4 r ¯ r^(')=m^(2)//4 bar(r)r^{\prime}=m^{2} / 4 \bar{r}r=m2/4r¯. Find the transformation from the conformal coordinate r ¯ r ¯ bar(r)\bar{r}r¯ to the Schwarzschild coordinate r r rrr.

§21.11. YORK'S "HANDLES" TO SPECIFY A 4-GEOMETRY

On a simultaneity-or on the simultaneity-of extremal proper volume, give the conformal part of the 3-geometry and give the two inequivalent components of the dynamically conjugate momentum in order (1) to have freely specifiable, but also complete, initial-value data and thus (2) to determine completely the whole generic four-dimensional spacetime manifold. This in brief is York's extension (1971, 1972b) to the generic case of what Brill did for special cases (see the preceding section). York and Brill acknowledge earlier considerations of Lichnerowicz (1944) and Bruhat (1962 and earlier papers cited there on conformal geometry and the initial-value problem). But why conformal geometry, and why pick such a special spacelike hypersurface on which to give the four dynamic data per spacepoint?
Few solutions of Maxwell's equations are simpler than an infinite plane monochromatic wave in Minkowski's flat spacetime, and few look more complex when examined on a spacelike slice cut through that spacetime in an arbitrary way, with local wiggles and waves, larger-scale lumps and bumps, and still larger-scale general curvatures. No one who wants to explore electrodynamics in its evolution with many-fingered time can avoid these complexities; and no one will accept these complexities of many-fingered time who wants to see the degrees of freedom of the electromagnetic field in and by themselves exhibited in their neatest form. He will pick the simplest kind of timelike slice he can find. On that simultaneity, there are two and only two field coordinates, and two and only two field momenta per spacepoint. Similarly in geometrodynamics.
When one wants to untangle the degrees of freedom of the geometry, as distinct from analyzing the dynamics of the geometry, one therefore retreats from the three items of information per spacepoint that are contained in a 3-geometry [or in any other way of analyzing the geometrodynamics, as especially seen in the "extrinsic time" formulation of Kuchař (1971b and 1972)] and following York (1) picks the simultaneity to have maximal proper volume and (2) on this simultaneity specifies the two "coordinate degrees of freedom per spacepoint" that are contained in the conformal part of the 3-geometry.
An element of proper volume g 1 / 2 d 3 x g 1 / 2 d 3 x g^(1//2)d^(3)xg^{1 / 2} d^{3} xg1/2d3x on the spacelike hypersurface Σ Σ Sigma\SigmaΣ undergoes, in the next unit interval of proper time as measured normal to the hypersurface, a fractional increase of proper volume [see Figure 21.3 and equations 21.59 and 21.66] given by
(21.143) Tr K = 1 2 g 1 / 2 Tr n (21.143) Tr K = 1 2 g 1 / 2 Tr n {:(21.143)-Tr K=-(1)/(2)g^(-1//2)Tr n:}\begin{equation*} -\operatorname{Tr} \boldsymbol{K}=-\frac{1}{2} g^{-1 / 2} \operatorname{Tr} \boldsymbol{n} \tag{21.143} \end{equation*}(21.143)TrK=12g1/2Trn
For the volume to be extremal this quantity must vanish at every point of Σ Σ Sigma\SigmaΣ. This condition is satisfied in a Friedmann universe (Chapter 27) and in a Taub universe (Chapter 30) at that value of the natural time-coordinate t t ttt at which the universe switches over from expansion to recontraction. It is remarkable that the same condition on the choice of simultaneity, Σ Σ Sigma\SigmaΣ, lets itself be formulated in the same natural way,
(21.144) Tr K = 0 or Tr π = 0 (21.144) Tr K = 0  or  Tr π = 0 {:(21.144)Tr K=0" or "Tr pi=0:}\begin{equation*} \operatorname{Tr} \boldsymbol{K}=0 \text { or } \operatorname{Tr} \boldsymbol{\pi}=0 \tag{21.144} \end{equation*}(21.144)TrK=0 or Trπ=0
The degrees of freedom of the geometry in brief
Pick hypersurface of extremal proper volume
Case of open 3-geometry
Meaning of conformal 3-geometry
for a closed universe altogether deprived of any symmetry whatsoever. Alternatively, one can deal with a spacetime that is topologically the product of an open 3 -space by the real line (time). Then it is natural to think of specifying the location in it of a bounding spacelike 2 -geometry S S SSS with the topology of a 2 -sphere. Then one has many ways to fill in the interior of S S SSS with a spacelike 3 -geometry Σ Σ Sigma\SigmaΣ; but of all these Σ Σ Sigma\SigmaΣ 's, only the one that is extremal, or only the ones that are extremal, satisfy (21.144).
Who is going to specify this 2 -geometry with the topology of a 2 -sphere? The choice of that 2-geometry is not a matter of indifference. In a given 4-geometry, distinct choices for the bounding 2-geometry will ordinarily give distinct results for the extremizing 3 -geometry, and therefore different choices for the "initial-value simultaneity," Σ Σ Sigma\SigmaΣ. No consideration immediately thrusts itself forward that would give preference to one choice of 2-geometry over another. However, no such infinity of options presents itself when one limits attention to a closed 3-geometry. Therefore it will give concreteness to the following analysis to consider it applied to a closed universe, even though the analysis surely lets itself be made well-defined in an open region by appropriate specification of boundary values on the closed 2-geometry that bounds that open region. In brief, by limiting attention to a closed 3 -geometry, one lets the obvious condition of closure take the place of boundary conditions that are not obvious.
York's analysis remains simple when his extrinsic time
τ = 2 3 g 1 / 2 Tr n = 4 3 Tr K τ = 2 3 g 1 / 2 Tr n = 4 3 Tr K tau=(2)/(3)g^(-1//2)Tr n=(4)/(3)Tr K\tau=\frac{2}{3} g^{-1 / 2} \operatorname{Tr} \boldsymbol{n}=\frac{4}{3} \operatorname{Tr} \boldsymbol{K}τ=23g1/2Trn=43TrK
has any constant value on the hypersurface, not only the value τ = 0 τ = 0 tau=0\tau=0τ=0 appropriate for the hypersurface of extremal proper volume.
On the simultaneity Σ Σ Sigma\SigmaΣ specified by the condition of constant extrinsic time, τ = τ = tau=\tau=τ= constant, begin by giving the conformal 3 -geometry,
(21.145) <= ( 3 ) <= ( the equivalence class of all those positive definite Riemannian three-dimensional metrics that are equivalent to each other under (1) diffeomorphism (smooth sliding of the points over the mainfold to new locations) or (2) changes of scale that vary smoothly from point to point, leaving fixed all local angles (ratios of local distances), but changing local distances themselves or (3) both. ) (21.145) <= ( 3 ) <=  the equivalence class of all those positive definite   Riemannian three-dimensional metrics that are   equivalent to each other under (1) diffeomorphism   (smooth sliding of the points over the mainfold to   new locations) or (2) changes of scale that vary   smoothly from point to point, leaving fixed all   local angles (ratios of local distances), but   changing local distances themselves or (3) both.  {:(21.145)<=^((3))<=([" the equivalence class of all those positive definite "],[" Riemannian three-dimensional metrics that are "],[" equivalent to each other under (1) diffeomorphism "],[" (smooth sliding of the points over the mainfold to "],[" new locations) or (2) changes of scale that vary "],[" smoothly from point to point, leaving fixed all "],[" local angles (ratios of local distances), but "],[" changing local distances themselves or (3) both. "]):}<={ }^{(3)}<=\left(\begin{array}{l} \text { the equivalence class of all those positive definite } \tag{21.145}\\ \text { Riemannian three-dimensional metrics that are } \\ \text { equivalent to each other under (1) diffeomorphism } \\ \text { (smooth sliding of the points over the mainfold to } \\ \text { new locations) or (2) changes of scale that vary } \\ \text { smoothly from point to point, leaving fixed all } \\ \text { local angles (ratios of local distances), but } \\ \text { changing local distances themselves or (3) both. } \end{array}\right)(21.145)<=(3)<=( the equivalence class of all those positive definite  Riemannian three-dimensional metrics that are  equivalent to each other under (1) diffeomorphism  (smooth sliding of the points over the mainfold to  new locations) or (2) changes of scale that vary  smoothly from point to point, leaving fixed all  local angles (ratios of local distances), but  changing local distances themselves or (3) both. )
The conformal 3-geometry is a geometric object that lends itself to definition and interpretation quite apart from the specific choice of coordinate system and even without need to use any coordinates at all. The conformal 3-geometry (on the hypersurface Σ Σ Sigma\SigmaΣ where τ = τ = tau=\tau=τ= constant) may be regarded much as one regards the magnetic field in electromagnetism. The case of conformally flat 3-geometry,
(21.146) d s 2 = ψ 4 ( x , y , z ) d s base 2 (21.146) d s 2 = ψ 4 ( x , y , z ) d s base  2 {:(21.146)ds^(2)=psi^(4)(x","y","z)ds_("base ")^(2):}\begin{equation*} d s^{2}=\psi^{4}(x, y, z) d s_{\text {base }}^{2} \tag{21.146} \end{equation*}(21.146)ds2=ψ4(x,y,z)dsbase 2
(with g i j base = δ i j g i j  base  = δ i j g_(ij" base ")=delta_(ij)g_{i j \text { base }}=\delta_{i j}gij base =δij ), is analogous to those initial-value situations in electromagnetism where the magnetic field is everywhere zero (the time-antisymmetric initial-value problem of Brill); but now we consider the case of general d s base 2 d s base  2 ds_("base ")^(2)d s_{\text {base }}^{2}dsbase 2.
The six metric coefficients g i j g i j g_(ij)g_{i j}gij of the conformal 3 -geometry, subject to being changed by change of the three coordinates x i x i x^(i)x^{i}xi, and undetermined at any one point up to a common position-dependent multiplicative factor, carry 6 3 1 = 2 6 3 1 = 2 6-3-1=26-3-1=2631=2 pieces of information per spacepoint. In this respect, they are like the components of the divergenceless magnetic field B B B\mathscr{B}B. The corresponding field momentum π E M i G i π E M i G i pi_(EM)^(i)propG^(i)\pi_{E M}^{i} \propto \mathcal{G}^{i}πEMiGi (Box 21.1, page 496) has its divergence specified by the charge density, and so also carries
two pieces of information (in addition to the prescribed information about the density of charge) per spacepoint.
The comparison is a little faulty between the components of B B B\mathscr{B}B and the metric coefficients. They are more like potentials than like components of the physically relevant field.
The appropriate measure of the "field" in geometrodynamics is the curvature tensor; but how can one possibly define a curvature tensor for a geometry that is as rudimentary as a conformal 3-geometry? York (1971) has raised and answered this question. The Weyl conformal-curvature tensor [equation (13.50) and exercise 13.13] is independent [in the proper ( 2 2 ) ( 2 2 ) ((2)/(2))\binom{2}{2}(22) representation], in spaces of higher dimensionality, of the position-dependent factor ψ 4 ψ 4 psi^(4)\psi^{4}ψ4 with which one multiplies the metric coefficients, but vanishes identically in three-dimensional space (exercise 21.21). One arrives at a non-zero conformally invariant measure of the curvature only when one goes to one higher derivative (exercise 21.22). In this way, one comes to York's curvature β ~ a b β ~ a b widetilde(beta)^(ab)\widetilde{\beta}^{a b}β~ab, here called Y a b Y a b Y^(ab)Y^{a b}Yab, a tensor density with these properties:
Y a b = Y b a ( symmetric ) ; Y a a = 0 (traceless); Y a b b = 0 (transverse); Y a b = Y b a (  symmetric  ) ; Y a a = 0  (traceless);  Y a b b = 0  (transverse);  {:[Y^(ab)=Y^(ba)(" symmetric ");],[Y_(a)^(a)=0" (traceless); "],[Y^(ab)_(∣b)=0" (transverse); "]:}\begin{gathered} Y^{a b}=Y^{b a}(\text { symmetric }) ; \\ Y_{a}^{a}=0 \text { (traceless); } \\ Y^{a b}{ }_{\mid b}=0 \text { (transverse); } \end{gathered}Yab=Yba( symmetric );Yaa=0 (traceless); Yabb=0 (transverse); 
Y a b Y a b Y^(ab)Y^{a b}Yab invariant with respect to position-dependent changes in the conformal scale factor;
Y a b = 0 Y a b = 0 Y^(ab)=0Y^{a b}=0Yab=0 when and only when the 3-geometry is conformally flat. (21.148)
Y a b Y a b Y^(ab)Y^{a b}Yab provides what York calls the pure spin-two representation of the 3-geometry intrinsic to Σ Σ Sigma\SigmaΣ. It is the analog of the field B B B\mathscr{B}B of electrodynamics on the spacelike initial-value simultaneity. It directly carries physical information about the conformal 3-geometry.
In addition to the conformal geometry ( 3 ) < ( 3 ) < ^((3)) <{ }^{(3)}<(3)<, specified by the "potentials" g i j / g 1 / 3 g i j / g 1 / 3 g_(ij)//g^(1//3)g_{i j} / g^{1 / 3}gij/g1/3, and measured by the "field components" Y i j Y i j Y^(ij)Y^{i j}Yij, one must also specify on Σ Σ Sigma\SigmaΣ the corresponding conjugate momenta:
The associated momenta
Unique solution for conformal factor
One seeks a solution ψ ψ psi\psiψ that is continuous over the closed manifold and everywhere real and positive. When does such a solution ψ ψ psi\psiψ of the elliptic equation (21.152) exist? When is it unique? Always (when M > 0 M > 0 M > 0M>0M>0 and τ 0 τ 0 tau!=0\tau \neq 0τ0 ), is the result of O'Murchadha and York (1973); see also earlier investigations of Choquet-Bruhat (1972). Some of the physical considerations that come into this kind of problem have been discussed by Wheeler (1964a, pp. 370-381).

§21.12. MACH'S PRINCIPLE AND THE ORIGIN OF INERTIA

In my opinion the general theory of relativity can only solve this problem [of inertia] satisfactorily if it regards the world as spatially self-enclosed.
On June 25, 1913, two years before he had discovered the geometrodynamic law that bears his name, Einstein (1913b) wrote to Ernst Mach (Figure 21.5) to express his appreciation for the inspiration that he had derived for his endeavors from Mach's ideas. In his great book, The Science of Mechanics, Mach [(1912), Chapter 2, section 6] had reasoned that it could not make sense to speak of the acceleration of a mass relative to absolute space. Anyone trying to clear physics of mystical ideas would do better, he reasoned, to speak of acceleration relative to the distant stars. But how can a star at a distance of 10 9 10 9 10^(9)10^{9}109 light-years contribute to inertia in the here and the now? To make a long story short, one can say at once that Einstein's theory (1) identifies gravitation as the mechanism by which matter there influences inertia here; (2) says that this coupling takes place on a spacelike hypersurface [in what one, without a closer examination, might mistakenly think to be a violation of the principle of causality; see Fermi (1932) for a discussion and clarification of the similar apparent paradox in electrodynamics; see also Einstein (1934), p. 84: "Moreover I believed that I could show on general considerations a law of gravitation invariant in relation to any transformation of coordinates whatever was inconsistent with the principle of causation. These were errors of thought which cost me two years of excessively hard work, until I finally recognized them as such at the end of 1915"]; (3) supplies in the initial-value equations of geometrodynamics a mathematical tool to describe this coupling; (4) demands closure of the geometry in space [one conjectures; see Wheeler ( 1959 , 1964 ( 1959 , 1964 (1959,1964(1959,1964(1959,1964 c) and Hönl ( 1962 ) ] ( 1962 ) ] (1962)](1962)](1962)], as a boundary condition on the initial-value equations if they are to yield a well-determined [and, we know now, a unique] 4-geometry; and (5) identifies the collection of local Lorentz frames near any point in this resulting spacetime as what one means quantitatively by speaking of inertia at that point. This is how one ends up with inertia here determined by density and flow of mass-energy there.
There are many scores of papers in the literature on Mach's principle, including many-even one by Lenin (English translation, 1927)-one could call anti-Machian; and many of them make interesting points [see especially the delightful dialog by Weyl (1924a) on "inertia and the cosmos," and the article (1957) and book (1961) of Sciama]. However, most of them were written before one had anything like the understanding of the initial-value problem that one possesses today. Therefore no
No violation of causality, despite appearances
An enormous literature
Figure 21.5.
Einstein's appreciation of Mach, written to Ernst Mach June 25, 1913, while Einstein was working hard at arriving at the final November 1915 formulation of standard general relativity. Regarding confirmation at a forthcoming eclipse: "If so, then your happy investigations on the foundations of mechanics, Planck's unjustified criticism notwithstanding, will receive brilliant confirmation. For it necessarily turns out that inertia originates in a kind of interaction between bodies, quite in the sense of your considerations on Newton's pail experiment. The first consequence is on p. 6 of my paper. The following additional points emerge: (1) If one accelerates a heavy shell of matter S S SSS, then a mass enclosed by that shell experiences an accelerative force. (2) If one rotates the shell relative to the fixed stars about an axis going through its center, a Coriolis force arises in the interior of the shell; that is, the plane of a Foucault pendulum is dragged around (with a practically unmeasurable small angular velocity)." Following the death of Mach, Einstein (1916a) wrote a tribute to the man and his work. Reprinted with the kind permission of the estate of Albert Einstein, Helen Dukas and Otto Nathan, executors.

Zines. 25. <compat>ᄑ 13.

Tosh pechentens Kolleye.'
Dues Lays hater tie moll serve nerve trabelniber Tilatori- taut med Grarstat-on uhalten;
che rash mocendlicher thither and quaitendern Zrvefel mon endlich farting gemorden it. Naichstes Yalu
beider Toumentimeteren beider Tounernfousterms aol exch geiger, ob dee LLcletstivhlen win der Tome gehreimuat aucrden, of m. a. W. Die gugrunde geley fondserventale trualiner wore der thequavaleny vire Bad. lemming Telmerifeld ousdenereests anklede patiofft.
wens j a j a jaj aja, os erfolveren Thu e geniale. Hetorsuchmegen sike due Gumedayen. fertigter Kratick gum liorty-enie
glaingenede bestaitignng. Demn es ergalt moch madt Notwonding keatt, dass dre Trigheit en edencs ot it Wechachoistrenng des toisper shem Urspuning hate, gany mon Stione, Theer Xibeleyrugzen gmin Nevotan' schen cfinen-Vercuch.
Line unte koncequeng in Nerescen Simen finden the oben anf tentle 6 der Aibeit is, hat soch furner folgendes ergelen:
  1. Beschlemnigt mans eve traje Kugelochale I, ar enfichit rach den Theorvie edm an hern
    ringeaellossewer Kinper exin theachen. nyyude tiraft
  2. Rot-ent whe tchale I hun eine clnerah hrea. Mittelpumkt gehanele Aahise ( uelation pom Syotem den Foxeternelseer system"), so entotefot 1 m 1 m 1m1 m1m In...enn der Iolvab esm Cornalis - Telal.


    I I III 'ko net inn esue grosae Frande, Thuen dies menter len zu kirmen, pancel pene Kuthk Dlandates sans schon sumen hithat angerecht furtigt esscherencen sree.
Mit gioseten Wochachtung gnteat the herglich Ethen ergebenet it. Howctem.
Dis duntre clumen hughen fore are thersendming Ohres Buches
Mach's principle updated and spelled out
attempt will be made to summarize or analyze the literature, which would demand a book in itself. Moreover, Mach's principle as presented here is more sharply formulated than Einstein ever put it in the literature [except for his considerations arguing that the universe must be closed; see Einstein's book (1950), pp. 107-108]; and Mach would surely have disowned it, for he could never bring himself to accept general (or even special) relativity. Nevertheless, it is a fact that Mach's principlethat matter there governs inertia here-and Riemann's idea-that the geometry of space responds to physics and participates in physics-were the two great currents of thought which Einstein, by means of his powerful equivalence principle, brought together into the present-day geometric description of gravitation and motion.
"Specify everywhere the distribution and flow of mass-energy and thereby determine the inertial properties of every test particle everywhere and at all times". Spelled out, this prescription demands (1) a way of speaking about "everywhere": a spacelike hypersurface Σ Σ Sigma\SigmaΣ. Let one insist-in conformity with Einstein-(2) that it be a closed 3-geometry, and for convenience, not out of necessity, (3) that τ τ tau\tauτ be independent of position on Σ Σ Sigma\SigmaΣ. (4) Specify this 3-geometry to the extent of giving the conformal metric; without the specification of at least this much 3-geometry, there would be no evident way to say "where" the mass-energy is to be located. (5) Give density ρ base ρ base  rho_("base ")\rho_{\text {base }}ρbase  as a function of position in this conformal 3-geometry. (6) Recognize that giving the mass-energy only of fields other than gravity is an inadequate way to specify the distribution of mass-energy throughout space. Formalistically, to be sure, the gravitational fields does not and cannot make any contribution to the source term that stands on the righthand side of Einstein's field equation. However, the analysis of gravitational waves (Chapters 18 and 35) shows that perturbations in the geometry of scale small compared to the scale of observation have to be regarded as carrying an effective content of mass-energy. Moreover, one has in a geon [Wheeler (1955); Brill and Hartle (1964); for more on gravitational-wave energy, see § 35.14 ] § 35.14 ] §35.14]\S 35.14]§35.14] an object built out of gravitational waves (or electromagnetic waves, or neutrinos, or any combination of the three) that holds itself together for a time that is long in comparison to the characteristic period of vibration of the waves. It looks from a distance like any other mass, even though nowhere in its interior can one put a finger and say "here is mass." Therefore it, like any other mass, must have "its influence on inertia." But to specify this mass, one must give enough information to characterize completely the gravitational waves on the simultaneity Σ Σ Sigma\SigmaΣ. For this, it is not enough merely to have given the two "wave-coordinates" per spacepoint that one possesses in ( 3 ) < ( 3 ) < ^((3)) <{ }^{(3)}<(3)<. One must give in addition (7) the two "wave-momenta" per spacepoint that appear in York's "momentum density of weight 5 / 3 , " π ~ a b 5 / 3 , " π ~ a b 5//3," widetilde(pi)^(ab)5 / 3, " \widetilde{\pi}^{a b}5/3,"π~ab; and at the same time, as an inextricable part of this operation, one must (8) specify the density of flow of field energy. (9) Solve for the conformal factor ψ ψ psi\psiψ. (10) Then one has complete initial-value data that satisfy the initial-value equations of general relativity. (11) These data now known, the remaining, dynamic, components of the field equation determine the 4 -geometry into the past and the future. (12) In this way, the inertial properties of every test particle are determined everywhere and at all times, giving concrete realization to Mach's principle.
Much must still be done to spell out the physics behind these equations and to
see this physics in action. Some significant progress had already been made in this direction before the present stage in one's understanding of the initial-value equations. Especially interesting are results of Thirring (1918) and (1921) and of Thirring and Lense (1918), discussed by Einstein (1950) in the third edition of his book, The Meaning of Relativity.
Consider a bit of solid ground near the geographic pole, and a support erected there, and from it hanging a pendulum. Though the sky is cloudy, the observer watches the track of the Foucault pendulum as it slowly turns through 360 360 360^(@)360^{\circ}360. Then the sky clears and, miracle of miracles, the pendulum is found to be swinging all the time on an arc fixed relative to the far-away stars. If "mass there governs inertia here," as envisaged by Mach, how can this be?
Enlarge the question. By the democratic principle that equal masses are created equal, the mass of the earth must come into the bookkeeping of the Foucault pendulum. Its plane of rotation must be dragged around with a slight angular velocity, ω drag ω drag  omega_("drag ")\omega_{\text {drag }}ωdrag , relative to the so-called "fixed stars." How much is ω drag ω drag  omega_("drag ")\omega_{\text {drag }}ωdrag  ? And how much would ω drag ω drag  omega_("drag ")\omega_{\text {drag }}ωdrag  be if the pendulum were surrounded by a rapidly spinning spherical shell of mass M M MMM and radius R shell R shell  R_("shell ")R_{\text {shell }}Rshell , turning at angular velocity ω shell ω shell  omega_("shell ")\omega_{\text {shell }}ωshell  ?
Einstein's theory says that inertia is a manifestation of the geometry of spacetime. It also says that geometry is affected by the presence of matter to an extent proportional to the factor G / c 2 = 0.742 × 10 28 cm / g G / c 2 = 0.742 × 10 28 cm / g G//c^(2)=0.742 xx10^(-28)cm//gG / c^{2}=0.742 \times 10^{-28} \mathrm{~cm} / \mathrm{g}G/c2=0.742×1028 cm/g. Simple dimensional considerations leave no room except to say that the rate of drag is proportional to a expression of the form
(21.155) ω drag = k G c 2 m shell, conv R shell ω shell = k m shell R shell ω shell . (21.155) ω drag  = k G c 2 m shell, conv  R shell  ω shell  = k m shell  R shell  ω shell  {:(21.155)omega_("drag ")=k(G)/(c^(2))(m_("shell, conv "))/(R_("shell "))omega_("shell ")=k(m_("shell "))/(R_("shell "))omega_("shell ")". ":}\begin{equation*} \omega_{\text {drag }}=k \frac{G}{c^{2}} \frac{m_{\text {shell, conv }}}{R_{\text {shell }}} \omega_{\text {shell }}=k \frac{m_{\text {shell }}}{R_{\text {shell }}} \omega_{\text {shell }} \text {. } \tag{21.155} \end{equation*}(21.155)ωdrag =kGc2mshell, conv Rshell ωshell =kmshell Rshell ωshell 
Here k k kkk is a numerical factor to be found only by detailed calculation. Lense and Thirring [(1918) and (1921)], starting with a flat background spacetime manifold, calculated in the weak-field approximation of Chapter 18 the effect of the moving current of mass on the metric. Expressed in polar coordinates, the metric acquires a non-zero coefficient g ϕ t g ϕ t g_(phi t)g_{\phi t}gϕt. Inserted into the equation of geodesic motion, this offdiagonal metric coefficient gives rise to a precession. This precession (defined here about an axis parallel to the axis of rotation, not about the local vertical) is given by an expression of the form (21.155), where the precession factor k k kkk has the value
(21.156) k = 4 / 3 (21.156) k = 4 / 3 {:(21.156)k=4//3:}\begin{equation*} k=4 / 3 \tag{21.156} \end{equation*}(21.156)k=4/3
There is a close parallelism between the magnetic component of the Maxwell field and the precession component of the Einstein field. In neither field does a source at rest produce the new kind of effect when acting on a test particle that is also at rest. One designs a circular current of charge to produce a magnetic field; and a test charge, in order to respond to this magnetic field, must also be in motion. Similarly here: no pendulum vibration means no pendulum precession. Moreover, the direction of the precession depends on where the pendulum is, relative to the rotating shell of mass. The precession factor k k kkk has the following values:
(21.157) k = 4 / 3 for pendulum anywhere inside rotating shell of mass; k = 4 / 3 for pendulum at North or South pole; k = 2 / 3 for pendulum just outside the rotating shell at its equator. (21.157) k = 4 / 3  for pendulum anywhere inside rotating   shell of mass;  k = 4 / 3  for pendulum at North or South pole;  k = 2 / 3  for pendulum just outside the rotating   shell at its equator.  {:(21.157){:[k=4//3,{:[" for pendulum anywhere inside rotating "],[" shell of mass; "]:}],[k=4//3," for pendulum at North or South pole; "],[k=-2//3," for pendulum just outside the rotating "],[" shell at its equator. "]:}:}\begin{array}{ll} k=4 / 3 & \begin{array}{l} \text { for pendulum anywhere inside rotating } \\ \text { shell of mass; } \end{array} \\ k=4 / 3 & \text { for pendulum at North or South pole; } \\ k=-2 / 3 & \text { for pendulum just outside the rotating } \tag{21.157}\\ \text { shell at its equator. } \end{array}(21.157)k=4/3 for pendulum anywhere inside rotating  shell of mass; k=4/3 for pendulum at North or South pole; k=2/3 for pendulum just outside the rotating  shell at its equator. 
This position-dependence of the drag, ω drag ω drag  omega_("drag ")\omega_{\text {drag }}ωdrag , makes still more apparent the analogy with magnetism, where the field of a rotating charged sphere points North at the center of the sphere, and North at both poles, but South at the equator.
Whether the Foucault pendulum is located in imagination at the center of the earth or in actuality at the North pole, the order of magnitude of the expected drag is
(21.158) ω drag m earth R earth ω earth 0.44 cm 6 × 10 8 cm 1 radian 13700 sec 5 × 10 14 rad / sec (21.158) ω drag  m earth  R earth  ω earth  0.44 cm 6 × 10 8 cm 1  radian  13700 sec 5 × 10 14 rad / sec {:[(21.158)omega_("drag ")∼(m_("earth "))/(R_("earth "))omega_("earth ")∼(0.44(cm))/(6xx10^(8)(cm))(1" radian ")/(13700sec)],[∼5xx10^(-14)rad//sec]:}\begin{align*} \omega_{\text {drag }} & \sim \frac{m_{\text {earth }}}{R_{\text {earth }}} \omega_{\text {earth }} \sim \frac{0.44 \mathrm{~cm}}{6 \times 10^{8} \mathrm{~cm}} \frac{1 \text { radian }}{13700 \mathrm{sec}} \tag{21.158}\\ & \sim 5 \times 10^{-14} \mathrm{rad} / \mathrm{sec} \end{align*}(21.158)ωdrag mearth Rearth ωearth 0.44 cm6×108 cm1 radian 13700sec5×1014rad/sec
too small to allow detection, let alone actual measurement, by any device so far built-but perhaps measurable by gyroscopes now under construction (§40.7). By contrast, near a rapidly spinning neutron star or near a black hole endowed with substantial angular momentum, the calculated drag effect is not merely detectable; it is even important (see Chapter 33 on the physics of a rotating black hole).
The distant stars must influence the natural plane of vibration of the Foucault pendulum as the nearby rotating shell of matter does, provided that the stars are not so far away ( r r r∼r \simr radius of universe) that the curvature of space begins to introduce substantial corrections into the calculation of Thirring and Lense. In other words, no reason is apparent why all masses should not be treated on the same footing, so that (21.158) more appropriately, if also somewhat symbolically, reads
Moreover, when there is no nearby shell of matter, or when it has negligible effects, the plane of vibration of the pendulum, if experience is any guide, cannot turn with respect to the frame defined by the far-away "stars." In this event ω Foucault ω Foucault  omega_("Foucault ")\omega_{\text {Foucault }}ωFoucault  must be identical with ω stars ω stars  omega_("stars ")\omega_{\text {stars }}ωstars ; or the "sum for inertia,"
(21.160) far-away "stars" m "star" r "star" m universe r universe (21.160)  far-away   "stars"  m "star"  r "star"  m universe  r universe  {:(21.160)sum_({:[" far-away "],[" "stars" "]:})(m_(""star" "))/(r_(""star" "))∼(m_("universe "))/(r_("universe ")):}\begin{equation*} \sum_{\substack{\text { far-away } \\ \text { "stars" }}} \frac{m_{\text {"star" }}}{r_{\text {"star" }}} \sim \frac{m_{\text {universe }}}{r_{\text {universe }}} \tag{21.160} \end{equation*}(21.160) far-away  "stars" m"star" r"star" muniverse runiverse 
must be of the order of unity. Just such a relation of approximate identity between the mass content of the universe and its radius at the phase of maximum expansion is a characteristic feature of the Friedman model and other simple models of a closed universe (Chapters 27 and 30). In this respect, Einstein's theory of Mach's principle exhibits a satisfying degree of self-consistency.
At phases of the dynamics of the universe other than the stage of maximum expansion, r universe r universe  r_("universe ")r_{\text {universe }}runiverse  can become arbitrarily small compared to m universe m universe  m_("universe ")m_{\text {universe }}muniverse . Then the ratio (21.160) can depart by powers of ten from unity. Regardless of this circumstance, one has no option but to understand that the effective value of the "sum for inertia" is still unity after all corrections have been made for the dynamics of contraction or expansion, for retardation, etc. Only so can ω Foucault ω Foucault  omega_("Foucault ")\omega_{\text {Foucault }}ωFoucault  retain its inescapable identity with ω far-away stars. ω far-away stars.  omega_("far-away stars. ")\omega_{\text {far-away stars. }}ωfar-away stars. . Fortunately, one does not have to pursue the theology of the "sum for inertia" to the uttermost of these sophistications to have a proper account of inertia. Mach's idea that mass there determines inertia here has its complete mathematical account in Einstein's geometrodynamic law, as already spelled out. For the first strong-field analysis of the dragging of the inertial reference system in the context of relativistic cosmology, see Brill and Cohen (1966) and Cohen and Brill (1967); see also $ 33.4 $ 33.4 $33.4\$ 33.4$33.4 for dragging by a rotating black hole.
Still another clarification is required of what Mach's principle means and how it is used. The inertial properties of a test particle are perfectly well-determined when that particle is moving in ideal Minkowski space. "Point out, please," the anti-Machian critic says, "the masses that are responsible for this inertia." In answer, recall that Einstein's theory includes not only the geometrodynamic law, but also, in Einstein's view, the boundary condition that the universe be closed. Thus the section of spacetime that is flat is to be viewed, not as infinite, but as part of a closed universe. (For a two-dimensional analog, fill a rubber balloon with water and set it on a glass tabletop and look at it from underneath). The part of the universe that is curved acquires its curvature by reason of its actual content of mass-energy or-if animated only by gravitational waves-by reason of its effective content of mass-energy. This mass-energy, real or effective, is to be viewed as responsible for the inertial properties of the test particle that at first sight looked all alone in the universe.
It in no way changes the qualitative character of the result to turn attention to a model universe where the region of Minkowski flatness, and all the other linear dimensions of the universe, have been augmented tenfold ("ten times larger balloon; ten times larger face"). The curvature and density of the curved part of the model universe are down by a factor of 100 , the volume is up by a factor of 1,000 , the mass is up by a factor of 10 ; but the ratio of mass to radius, or the "sum for inertia" (the poor man's substitute for a complete initial-value calculation) is unchanged.
Einstein acknowledged a debt of parentage for his theory to Mach's principle (Figure 21.5). It is therefore only justice that Mach's principle should in return today owe its elucidation to Einstein's theory.

Exercise 21.21. WHY THE WEYL CONFORMAL CURVATURE TENSOR VANISHES

How many independent components does the Riemann curvature tensor have in threedimensional space? How many does the Ricci curvature tensor have? Show that the two tensors are related by the formula
Minkowski geometry as limit of a closed 3-geometry
R a b c d = δ b d R a c δ c d R a b + g a c R b d g a b R c d + 1 2 R ( δ c d g a b δ b d g a c ) R a b c d = δ b d R a c δ c d R a b + g a c R b d g a b R c d + 1 2 R δ c d g a b δ b d g a c {:[R_(abc)^(d)=delta_(b)^(d)R_(ac)-delta_(c)^(d)R_(ab)+g_(ac)R_(b)^(d)-g_(ab)R_(c)^(d)],[+(1)/(2)R(delta_(c)^(d)g_(ab)-delta_(b)^(d)g_(ac))]:}\begin{aligned} R_{a b c}^{d}= & \delta_{b}^{d} R_{a c}-\delta_{c}^{d} R_{a b}+g_{a c} R_{b}^{d}-g_{a b} R_{c}^{d} \\ & +\frac{1}{2} R\left(\delta_{c}^{d} g_{a b}-\delta_{b}^{d} g_{a c}\right) \end{aligned}Rabcd=δbdRacδcdRab+gacRbdgabRcd+12R(δcdgabδbdgac)
with no need of any Weyl conformal-curvature tensor to specify (as in higher dimensions) the further details of the Riemann tensor. Show that the Weyl tensor, from an n n nnn-dimensional modification of equation (13.50) as in exercise 13.13, vanishes for n = 2 n = 2 n=2n=2n=2.

Exercise 21.22. YORK'S CURVATURE

[York (1971)]. (a) Define the tensor [Eisenhart (1926)]
R a b c = R a b c R a c b + 1 4 ( g a c R b g a b R c ) . R a b c = R a b c R a c b + 1 4 g a c R b g a b R c . R_(abc)=R_(ab∣c)-R_(ac∣b)+(1)/(4)(g_(ac)R_(∣b)-g_(ab)R_(∣c)).R_{a b c}=R_{a b \mid c}-R_{a c \mid b}+\frac{1}{4}\left(g_{a c} R_{\mid b}-g_{a b} R_{\mid c}\right) .Rabc=RabcRacb+14(gacRbgabRc).
(b) Show that a 3-geometry is conformally flat when and only when R a b c = 0 R a b c = 0 R_(abc)=0R_{a b c}=0Rabc=0.
(c) Show that the following identities hold and reduce to five the number of independent components of R a b c R a b c R_(abc)R_{a b c}Rabc :
R a c a = g a b R b a c = 0 R a b c + R a c b = 0 ; R a b c + R c a b + R b c a = 0 . R a c a = g a b R b a c = 0 R a b c + R a c b = 0 ; R a b c + R c a b + R b c a = 0 . {:[R_(ac)^(a)=g^(ab)R_(bac)=0],[R_(abc)+R_(acb)=0;],[R_(abc)+R_(cab)+R_(bca)=0.]:}\begin{gathered} R_{a c}^{a}=g^{a b} R_{b a c}=0 \\ R_{a b c}+R_{a c b}=0 ; \\ R_{a b c}+R_{c a b}+R_{b c a}=0 . \end{gathered}Raca=gabRbac=0Rabc+Racb=0;Rabc+Rcab+Rbca=0.
(d) Show that Yorks' curvature
Y a b = g 1 / 3 [ a e f ] ( R f b 1 4 δ f b R ) l e = 1 2 g 1 / 3 [ a e f ] g b m R m e l Y a b = g 1 / 3 [ a e f ] R f b 1 4 δ f b R l e = 1 2 g 1 / 3 [ a e f ] g b m R m e l {:[Y^(ab)=g^(1//3)[aef](R_(f)^(b)-(1)/(4)delta_(f)^(b)R)_(le)],[=-(1)/(2)g^(1//3)[aef]g^(bm)R_(mel)]:}\begin{aligned} Y^{a b} & =g^{1 / 3}[a e f]\left(R_{f}^{b}-\frac{1}{4} \delta_{f}^{b} R\right)_{l e} \\ & =-\frac{1}{2} g^{1 / 3}[a e f] g^{b m} R_{m e l} \end{aligned}Yab=g1/3[aef](Rfb14δfbR)le=12g1/3[aef]gbmRmel
is conformally invariant and has the properties listed in equations (21.148).

Exercise 21.23. PULLING THE POYNTING FLUX VECTOR "OUT OF THE AIR"

From the condition that the Hamilton-Jacobi functional S ( g i j , A m ) S g i j , A m S(g_(ij),A_(m))S\left(g_{i j}, A_{m}\right)S(gij,Am) (extremal of the action integral) for the combined Einstein and Maxwell fields, ostensibly dependent on the six metric coefficients g i j ( x , y , z ) g i j ( x , y , z ) g_(ij)(x,y,z)g_{i j}(x, y, z)gij(x,y,z) and the three potentials A m ( x , y , z ) A m ( x , y , z ) A_(m)(x,y,z)A_{m}(x, y, z)Am(x,y,z), shall actually depend only on the 3-geometry of the spacelike hypersurface and the distribution of magnetic field strength on this hypersurface, show that the geometrodynamic field momentum π i j = δ S / δ g i j π i j = δ S / δ g i j pi^(ij)=delta S//deltag_(ij)\pi^{i j}=\delta S / \delta g_{i j}πij=δS/δgij satisfies a condition of the form
π i j j = c [ i m n ] E m B n , π i j j = c [ i m n ] E m B n , pi^(ij)_(∣j)=c[imn]E_(m)B_(n),\pi^{i j}{ }_{\mid j}=c[i m n] \mathcal{E}_{m} \mathscr{B}_{n},πijj=c[imn]EmBn,
and evaluate the coefficient c c ccc in this equation [Wheeler (1968b)]. Hint: Note that the transformation
x i x i ξ i , g i j g i j + ξ i j + ξ j i x i x i ξ i , g i j g i j + ξ i j + ξ j i x^(i)longrightarrowx^(i)-xi^(i),g_(ij)longrightarrowg_(ij)+xi_(i∣j)+xi_(j∣i)x^{i} \longrightarrow x^{i}-\xi^{i}, g_{i j} \longrightarrow g_{i j}+\xi_{i \mid j}+\xi_{j \mid i}xixiξi,gijgij+ξij+ξji
in no way changes the 3-geometry itself, and therefore the corresponding induced change in S S SSS,
δ S = [ δ S δ g i j δ g i j + δ S δ A m δ A m ] d 3 x δ S = δ S δ g i j δ g i j + δ S δ A m δ A m d 3 x delta S=int[(delta S)/(deltag_(ij))deltag_(ij)+(delta S)/(deltaA_(m))deltaA_(m)]d^(3)x\delta S=\int\left[\frac{\delta S}{\delta g_{i j}} \delta g_{i j}+\frac{\delta S}{\delta A_{m}} \delta A_{m}\right] d^{3} xδS=[δSδgijδgij+δSδAmδAm]d3x
must vanish identically for arbitrary choice of the ξ i ( x , y , z ) ξ i ( x , y , z ) xi^(i)(x,y,z)\xi^{i}(x, y, z)ξi(x,y,z), which measure the equivalent of the sliding of a ruled transparent rubber sheet over an automobile fender.
Exercise 21.24. THE EXTREMAL ACTION ASSOCIATED WITH THE HILBERT ACTION PRINCIPLE DEPENDS ON CONFORMAL 3-GEOMETRY AND EXTRINSIC TIME [K. Kuchař (1972) and J. York (1972)]
Show that the data demanded by the Hilbert action principle δ ( 4 ) R ( ( 4 ) g ) 1 / 2 d 4 x = 0 δ ( 4 ) R ( 4 ) g 1 / 2 d 4 x = 0 deltaint^((4))R(-^((4))g)^(1//2)d^(4)x=0\delta \int^{(4)} R\left(-^{(4)} g\right)^{1 / 2} d^{4} x=0δ(4)R((4)g)1/2d4x=0 on each of the two bounding spacelike hypersurfaces consist of (1) the conformal 3-geometry ( 3 ) < ( 3 ) < ^((3)) <{ }^{(3)}<(3)< of the hypersurface plus (2) the extrinsic time variable defined by
τ = 2 3 g 1 / 2 Tr Π = 4 3 Tr K τ = 2 3 g 1 / 2 Tr Π = 4 3 Tr K tau=(2)/(3)g^(-1//2)Tr Pi=(4)/(3)Tr K\tau=\frac{2}{3} g^{-1 / 2} \operatorname{Tr} \boldsymbol{\Pi}=\frac{4}{3} \operatorname{Tr} \boldsymbol{K}τ=23g1/2TrΠ=43TrK
conveniently represented by the pictogram 2 2 2 2 2∼22 \sim 222, measured by one number per spacepoint, and independent of the conformal factor in the metric of the 3-geometry. This done, explain in a few words why in this formulation of geometrodynamics the Hamilton-Jacobi function ( \hbar times the phase of the wave function in the semiclassical or JWKB approximation) is appropriately expressed in the form
S = S ( ( 3 ) < , ) S = S ( 3 ) < , S=S(^((3)) < ,|--^(**))S=S\left({ }^{(3)}<, \vdash^{*}\right)S=S((3)<,)

§21.13. JUNCTION CONDITIONS

The intrinsic and extrinsic curvatures of a hypersurface, which played such fundamental roles in the initial-value formalism, are also powerful tools in the analysis of "junction conditions."
Recall the junction conditions of electrodynamics: across any surface (e.g., a capacitor plate), the tangential part of the electric field, E E E_(||)\boldsymbol{E}_{\|}E, and the normal part of the magnetic field, B B B_(_|_)\boldsymbol{B}_{\perp}B, must be continuous; thus,
[ E ] ( discontinuity in E ) ( E on " + " side of surface ) ( E on " " side of surface ) (21.161a) E + E = 0 , (21.161b) [ B ] B + B = 0 ; E  discontinuity in  E E on "  +  " side of surface  E on "   " side of   surface  ) (21.161a) E + E = 0 , (21.161b) B B + B = 0 ; {:[[E_(||)]-=(" discontinuity in "E_(||))],[-=(E_(||)"on " "+" " side of surface ")-(E_(||)"on " "-:}" " side of "],[" surface ")],[(21.161a)-=E_(||)^(+)-E_(||)^(-)=0","],[(21.161b)[B_(_|_)]-=B_(_|_)^(+)-B_(_|_)^(-)=0;]:}\begin{align*} {\left[\boldsymbol{E}_{\|}\right] } & \equiv\left(\text { discontinuity in } \boldsymbol{E}_{\|}\right) \\ & \equiv\left(\boldsymbol{E}_{\|} \text {on " }+ \text { " side of surface }\right)-\left(\boldsymbol{E}_{\|} \text {on " }-\right. \text { " side of } \\ & \text { surface }) \\ & \equiv \boldsymbol{E}_{\|}^{+}-\boldsymbol{E}_{\|}^{-}=0, \tag{21.161a}\\ {\left[\boldsymbol{B}_{\perp}\right] } & \equiv \boldsymbol{B}_{\perp}^{+}-\boldsymbol{B}_{\perp}^{-}=0 ; \tag{21.161b} \end{align*}[E]( discontinuity in E)(Eon " + " side of surface )(Eon "  " side of  surface )(21.161a)E+E=0,(21.161b)[B]B+B=0;
while the "jump" in the parts E E E_(_|_)\boldsymbol{E}_{\perp}E and B B B_(||∣)\boldsymbol{B}_{\| \mid}Bmust be related to the charge density (charge per unit area) σ σ sigma\sigmaσ, the current density (current per unit area) j j j\boldsymbol{j}j, and the unit normal to the surface n n n\boldsymbol{n}n by the formulas
(21.161c) [ E ] = E + E = 4 π σ n , (21.161d) [ B ] = B + B = 4 π j × n . (21.161c) E = E + E = 4 π σ n , (21.161d) B = B + B = 4 π j × n . {:[(21.161c)[E_(_|_)]=E_(_|_)^(+)-E_(_|_)^(-)=4pi sigma n","],[(21.161d)[B_(||)]=B_(||)^(+)-B_(||)^(-)=4pi j xx n.]:}\begin{gather*} {\left[\boldsymbol{E}_{\perp}\right]=\boldsymbol{E}_{\perp}^{+}-\boldsymbol{E}_{\perp}^{-}=4 \pi \sigma \boldsymbol{n},} \tag{21.161c}\\ {\left[\boldsymbol{B}_{\|}\right]=\boldsymbol{B}_{\|}^{+}-\boldsymbol{B}_{\|}^{-}=4 \pi j \times \boldsymbol{n} .} \tag{21.161d} \end{gather*}(21.161c)[E]=E+E=4πσn,(21.161d)[B]=B+B=4πj×n.
Recall also that one derives these junction conditions by integrating Maxwell's equations over a "pill box" that is centered on the surface.
Similar junction conditions, derivable in a similar manner, apply to the gravitational field (spacetime curvature), and to the stress-energy that generates it.* Focus
Junction conditions for electrodynamics
Figure 21.6.
Gaussian normal coordinates in the neighborhood of a 3 -surface Σ Σ Sigma\SigmaΣ. The metric in Gaussian normal coordinates has the form
d s 2 = ( n n ) 1 d n 2 + g i j d x i d x j d s 2 = ( n n ) 1 d n 2 + g i j d x i d x j ds^(2)=(n*n)^(-1)dn^(2)+g_(ij)dx^(i)dx^(j)d s^{2}=(\boldsymbol{n} \cdot \boldsymbol{n})^{-1} d n^{2}+g_{i j} d x^{i} d x^{j}ds2=(nn)1dn2+gijdxidxj
with n = / n , ( n n ) = 1 n = / n , ( n n ) = 1 n=del//del n,(n*n)=-1\boldsymbol{n}=\partial / \partial n,(\boldsymbol{n} \cdot \boldsymbol{n})=-1n=/n,(nn)=1 if the surface is spacelike, and ( n n ) = 1 ( n n ) = 1 (n*n)=1(\boldsymbol{n} \cdot \boldsymbol{n})=1(nn)=1 if it is timelike. (See exercise 27.2.) The extrinsic curvature of the surfaces n = n = n=n=n= constant is K i j = 1 2 g i j / n K i j = 1 2 g i j / n K_(ij)=-(1)/(2)delg_(ij)//del nK_{i j}=-\frac{1}{2} \partial g_{i j} / \partial nKij=12gij/n, and the Einstein field equations written in " 3 + 1 3 + 1 3+13+13+1 " form are (21.162).
attention on a specific three-dimensional slice through spacetime-the 3-surface Σ Σ Sigma\SigmaΣ on Figure 21.6. Let the surface be either spacelike [unit normal n n n\boldsymbol{n}n timelike; ( n n ) = 1 ( n n ) = 1 (n*n)=-1(\boldsymbol{n} \cdot \boldsymbol{n})=-1(nn)=1 ] or timelike [ n n n\boldsymbol{n}n spacelike; ( n n ) = + 1 ( n n ) = + 1 (n*n)=+1(\boldsymbol{n} \cdot \boldsymbol{n})=+1(nn)=+1 ]. The null case will be discussed later. As an aid in deriving junction conditions, introduce Gaussian normal coordinates in the neighborhood of Σ Σ Sigma\SigmaΣ [see the paragraph preceeding equation (21.82)]. In terms of the intrinsic and extrinsic curvatures of Σ Σ Sigma\SigmaΣ and of neighboring 3 -surfaces n = n = n=n=n= constant, the Einstein tensor and Einstein field equation have components
(21.162a) G n n = 1 2 ( ( 3 ) R + 1 2 ( n n ) 1 { ( Tr K ) 2 Tr ( K 2 ) } = 8 π T n n , (21.162b) G n i = ( n n ) 1 { K i m m ( Tr K ) i } = 8 π T n i , G i j = ( 3 ) G i j + ( n n ) 1 { ( K i j δ i j Tr K ) , n (21.162c) ( Tr K ) K i j + 1 2 δ i j ( Tr K ) 2 + 1 2 δ i j Tr ( K 2 ) } = 8 π T i j . (21.162a) G n n = 1 2 ( 3 ) R + 1 2 ( n n ) 1 ( Tr K ) 2 Tr K 2 = 8 π T n n , (21.162b) G n i = ( n n ) 1 K i m m ( Tr K ) i = 8 π T n i , G i j = ( 3 ) G i j + ( n n ) 1 K i j δ i j Tr K , n (21.162c) ( Tr K ) K i j + 1 2 δ i j ( Tr K ) 2 + 1 2 δ i j Tr K 2 = 8 π T i j . {:[(21.162a)G^(n)_(n)=-(1)/(2)(^((3))R+(1)/(2)(n*n)^(-1){(Tr K)^(2)-Tr(K^(2))}=8piT^(n)_(n),:}],[(21.162b)G^(n)_(i)=-(n*n)^(-1){K_(i)^(m)_(∣m)-(Tr K)_(∣i)}=8piT^(n)_(i)","],[G^(i)_(j)=^((3))G^(i)_(j)+(n*n)^(-1){(K^(i)_(j)-delta^(i)_(j)Tr K)_(,n):}],[(21.162c){:-(Tr K)K^(i)_(j)+(1)/(2)delta^(i)_(j)(Tr K)^(2)+(1)/(2)delta^(i)_(j)Tr(K^(2))}=8piT^(i)_(j).]:}\begin{align*} G^{n}{ }_{n}= & -\frac{1}{2}\left({ }^{(3)} R+\frac{1}{2}(\boldsymbol{n} \cdot \boldsymbol{n})^{-1}\left\{(\operatorname{Tr} \boldsymbol{K})^{2}-\operatorname{Tr}\left(\boldsymbol{K}^{2}\right)\right\}=8 \pi T^{n}{ }_{n},\right. \tag{21.162a}\\ G^{n}{ }_{i}= & -(\boldsymbol{n} \cdot \boldsymbol{n})^{-1}\left\{K_{i}{ }^{m}{ }_{\mid m}-(\operatorname{Tr} \boldsymbol{K})_{\mid i}\right\}=8 \pi T^{n}{ }_{i}, \tag{21.162b}\\ G^{i}{ }_{j}= & { }^{(3)} G^{i}{ }_{j}+(\boldsymbol{n} \cdot \boldsymbol{n})^{-1}\left\{\left(K^{i}{ }_{j}-\delta^{i}{ }_{j} \operatorname{Tr} \boldsymbol{K}\right)_{, n}\right. \\ & \left.-(\operatorname{Tr} \boldsymbol{K}) K^{i}{ }_{j}+\frac{1}{2} \delta^{i}{ }_{j}(\operatorname{Tr} \boldsymbol{K})^{2}+\frac{1}{2} \delta^{i}{ }_{j} \operatorname{Tr}\left(\boldsymbol{K}^{2}\right)\right\}=8 \pi T^{i}{ }_{j} . \tag{21.162c} \end{align*}(21.162a)Gnn=12((3)R+12(nn)1{(TrK)2Tr(K2)}=8πTnn,(21.162b)Gni=(nn)1{Kimm(TrK)i}=8πTni,Gij=(3)Gij+(nn)1{(KijδijTrK),n(21.162c)(TrK)Kij+12δij(TrK)2+12δijTr(K2)}=8πTij.
[See equations (21.77), (21.81), (21.76), and (21.82).]
Suppose that the stress-energy tensor T α β T α β T^(alpha)_(beta)T^{\alpha}{ }_{\beta}Tαβ contains a "delta-function singularity" at Σ Σ Sigma\SigmaΣ-i.e., suppose that Σ Σ Sigma\SigmaΣ is the "world tube" of a two-dimensional surface with finite 4 -momentum per unit area (analog of surface charge and surface current in electrodynamics). Then define the surface stress-energy tensor on Σ Σ Sigma\SigmaΣ to be the integral of T α β T α β T^(alpha)_(beta)T^{\alpha}{ }_{\beta}Tαβ with respect to proper distance ( n n nnn ), measured perpendicularly through Σ Σ Sigma\SigmaΣ :
(21.163) S α β = Lim ε 0 [ ε + ε T α β d n ] (21.163) S α β = Lim ε 0 ε + ε T α β d n {:(21.163)S^(alpha)_(beta)=Lim_(epsi rarr0)[int_(-epsi)^(+epsi)T^(alpha)_(beta)dn]:}\begin{equation*} S^{\alpha}{ }_{\beta}=\operatorname{Lim}_{\varepsilon \rightarrow 0}\left[\int_{-\varepsilon}^{+\varepsilon} T^{\alpha}{ }_{\beta} d n\right] \tag{21.163} \end{equation*}(21.163)Sαβ=Limε0[ε+εTαβdn]
To discover the effect of this surface layer on the spacetime geometry, perform a "pill-box integration" of the Einstein field equation (21.162)
(21.164) Lim ε 0 [ ε + ε G α β d n ] = 8 π S α β . (21.164) Lim ε 0 ε + ε G α β d n = 8 π S α β . {:(21.164)Lim_(epsi rarr0)[int_(-epsi)^(+epsi)G^(alpha)_(beta)dn]=8piS^(alpha)_(beta).:}\begin{equation*} \operatorname{Lim}_{\varepsilon \rightarrow 0}\left[\int_{-\varepsilon}^{+\varepsilon} G^{\alpha}{ }_{\beta} d n\right]=8 \pi S^{\alpha}{ }_{\beta} . \tag{21.164} \end{equation*}(21.164)Limε0[ε+εGαβdn]=8πSαβ.
Examine the integral of G α β G α β G^(alpha)_(beta)G^{\alpha}{ }_{\beta}Gαβ. If the 3 -metric g i j g i j g_(ij)g_{i j}gij were to contain a delta function or a discontinuity at Σ Σ Sigma\SigmaΣ, then Σ Σ Sigma\SigmaΣ would not have any well-defined 3-geometry-a physically inadmissible situation, even in the presence of surface layers. Absence of delta functions, δ ( n ) δ ( n ) delta(n)\delta(n)δ(n), in g i j g i j g_(ij)g_{i j}gij means absence of delta functions in ( 3 ) R ( 3 ) R ^((3))R{ }^{(3)} R(3)R; absence of discontinuities in g i j g i j g_(ij)g_{i j}gij means absence of delta functions in K i j = 1 2 g i j , n K i j = 1 2 g i j , n K_(ij)=-(1)/(2)g_(ij,n)K_{i j}=-\frac{1}{2} g_{i j, n}Kij=12gij,n. Thus, equations (21.162) when integrated say
(21.165a) G n n d n = 0 = 8 π S n n , (21.165b) G n i d n = 0 = 8 π S n , (21.165c) G i i d n = ( n n ) ( γ j i δ i j Tr γ ) = 8 π S i , (21.165a) G n n d n = 0 = 8 π S n n , (21.165b) G n i d n = 0 = 8 π S n , (21.165c) G i i d n = ( n n ) γ j i δ i j Tr γ = 8 π S i , {:[(21.165a)intG_(n)^(n)dn=0=8piS_(n)^(n)","],[(21.165b)intG^(n)_(i)dn=0=8piS^(n)","],[(21.165c)intG_(i)^(i)*dn=(n*n)(gamma_(j)^(i)-delta^(i)_(j)Tr gamma)=8piS^(i)","]:}\begin{gather*} \int G_{n}^{n} d n=0=8 \pi S_{n}^{n}, \tag{21.165a}\\ \int G^{n}{ }_{i} d n=0=8 \pi S^{n}, \tag{21.165b}\\ \int G_{i}^{i} \cdot d n=(\boldsymbol{n} \cdot \boldsymbol{n})\left(\gamma_{j}^{i}-\delta^{i}{ }_{j} \operatorname{Tr} \boldsymbol{\gamma}\right)=8 \pi S^{i}, \tag{21.165c} \end{gather*}(21.165a)Gnndn=0=8πSnn,(21.165b)Gnidn=0=8πSn,(21.165c)Giidn=(nn)(γjiδijTrγ)=8πSi,
where γ i j γ i j gamma^(i)_(j)\gamma^{i}{ }_{j}γij is the "jump" in the components of the extrinsic curvature
(21.166) γ [ K ] ( K on " n = + ε side" of Σ ) ( K on " n = ε side" of Σ ) K + K . (21.166) γ [ K ] ( K  on "  n = + ε  side" of  Σ ) ( K  on "  n = ε  side" of  Σ ) K + K . {:[(21.166)gamma-=[K]-=(K" on " "n=+epsi" side" of "Sigma)-(K" on " "n=-epsi" side" of "Sigma)],[-=K^(+)-K^(-).]:}\begin{align*} \boldsymbol{\gamma} & \equiv[\boldsymbol{K}] \equiv(\boldsymbol{K} \text { on " } n=+\varepsilon \text { side" of } \boldsymbol{\Sigma})-(\boldsymbol{K} \text { on " } n=-\varepsilon \text { side" of } \Sigma) \tag{21.166}\\ & \equiv \boldsymbol{K}^{+}-\boldsymbol{K}^{-} . \end{align*}(21.166)γ[K](K on " n=+ε side" of Σ)(K on " n=ε side" of Σ)K+K.
In the absence of a delta-function surface layer, the above junction conditions say, simply, that γ [ K ] = 0 γ [ K ] = 0 gamma-=[K]=0\boldsymbol{\gamma} \equiv[\boldsymbol{K}]=0γ[K]=0. In words: if one examines how Σ Σ Sigma\boldsymbol{\Sigma}Σ is embedded in the spacetime above its "upper"face, and how it is embedded in the spacetime below its "lower" face, one must discover identical embeddings-i.e., identical extrinsic curvatures K K K\boldsymbol{K}K. Of course, the intrinsic curvature of Σ Σ Sigma\boldsymbol{\Sigma}Σ must also be the same, whether viewed from above or below. More briefly:
(21.167) (absence of surface layers) ("continuity" of g i j and K i j ). (21.167)  (absence of surface layers)   ("continuity" of  g i j  and  K i j  ).  {:(21.167)" (absence of surface layers) "⇄" ("continuity" of "g_(ij)" and "K_(ij)" ). ":}\begin{equation*} \text { (absence of surface layers) } \rightleftarrows \text { ("continuity" of } g_{i j} \text { and } K_{i j} \text { ). } \tag{21.167} \end{equation*}(21.167) (absence of surface layers)  ("continuity" of gij and Kij ). 
If a surface layer is present, then Σ Σ Sigma\SigmaΣ must be the world tube of a two-dimensional layer of matter, and the normal to Σ Σ Sigma\SigmaΣ must be spacelike, ( n n ) = + 1 ( n n ) = + 1 (n*n)=+1(\boldsymbol{n} \cdot \boldsymbol{n})=+1(nn)=+1. The junction conditions ( 21.165 a , b ) ( 21.165 a , b ) (21.165a,b)(21.165 \mathrm{a}, \mathrm{b})(21.165a,b) then have the simple physical meaning
(21.168a) S ( n , ) = 0 ( the momentum flow is entirely in Σ ; i.e., no momentum associated with the surface layer flows out of Σ ; i.e., Σ is the world tube of the surface layer ) , (21.168a) S ( n , ) = 0  the momentum flow is entirely in  Σ  i.e., no momentum associated with the   surface layer flows out of  Σ ; i.e.,  Σ  is the world tube of the surface layer  , {:(21.168a)S(n","dots)=0⇆([" the momentum flow is entirely in "Sigma"; "],[" i.e., no momentum associated with the "],[" surface layer flows out of "Sigma"; i.e., "Sigma],[" is the world tube of the surface layer "])",":}\boldsymbol{S}(\boldsymbol{n}, \ldots)=0 \leftrightarrows\left(\begin{array}{l} \text { the momentum flow is entirely in } \boldsymbol{\Sigma} \text {; } \tag{21.168a}\\ \text { i.e., no momentum associated with the } \\ \text { surface layer flows out of } \boldsymbol{\Sigma} \text {; i.e., } \Sigma \\ \text { is the world tube of the surface layer } \end{array}\right),(21.168a)S(n,)=0( the momentum flow is entirely in Σ i.e., no momentum associated with the  surface layer flows out of Σ; i.e., Σ is the world tube of the surface layer ),
which tells one nothing new. The junction condition ( 21.165 c ) ( 21.165 c ) (21.165c)(21.165 \mathrm{c})(21.165c) says that the surface stress-energy generates a discontinuity in the extrinsic curvature (different embedding in spacetime "above" Σ Σ Sigma\SigmaΣ than "below" Σ Σ Sigma\SigmaΣ ), given by
(21.168b) γ i j δ i j Tr γ = 8 π S i j . (21.168b) γ i j δ i j Tr γ = 8 π S i j . {:(21.168b)gamma^(i)_(j)-delta^(i)_(j)Tr gamma=8piS^(i)_(j).:}\begin{equation*} \gamma^{i}{ }_{j}-\delta^{i}{ }_{j} \operatorname{Tr} \boldsymbol{\gamma}=8 \pi S^{i}{ }_{j} . \tag{21.168b} \end{equation*}(21.168b)γijδijTrγ=8πSij.
Of course, the intrinsic geometry of Σ Σ Sigma\SigmaΣ must be the same as seen from above and below,
(21.169) g i j continuous across Σ (21.169) g i j  continuous across  Σ {:(21.169)g_(ij)" continuous across "Sigma:}\begin{equation*} g_{i j} \text { continuous across } \Sigma \tag{21.169} \end{equation*}(21.169)gij continuous across Σ
Derivation of junction conditions
Junction conditions in absence of surface layers
Junction conditions for a surface layer
Equation of motion for a surface layer
Gravitational-wave shock fronts
In analyzing surface layers, one uses not only the junction conditions (21.168a) to (21.169), but also the four-dimensional Einstein field equation applied on each side of the surface Σ Σ Sigma\SigmaΣ separately, and also an equation of motion for the surface stressenergy. The equation of motion is derived by examining the jump in the field equation G n i = 8 π T n i G n i = 8 π T n i G^(n)_(i)=8piT^(n)_(i)G^{n}{ }_{i}=8 \pi T^{n}{ }_{i}Gni=8πTni (equation 21.162b); thus [ G n i ] = 8 π [ T n i ] G n i = 8 π T n i [G^(n)_(i)]=8pi[T^(n)_(i)]\left[G^{n}{ }_{i}\right]=8 \pi\left[T^{n}{ }_{i}\right][Gni]=8π[Tni] says
( γ i m δ i m Tr γ ) m = 8 π [ T n i ] ; γ i m δ i m Tr γ m = 8 π T n i ; (gamma_(i)^(m)-delta_(i)^(m)Tr gamma)_(∣m)=-8pi[T^(n)_(i)];\left(\gamma_{i}{ }^{m}-\delta_{i}{ }^{m} \operatorname{Tr} \boldsymbol{\gamma}\right)_{\mid m}=-8 \pi\left[T^{n}{ }_{i}\right] ;(γimδimTrγ)m=8π[Tni];
and when reexpressed in terms of S i m S i m S_(i)^(m)S_{i}{ }^{m}Sim by means of the junction condition (21.168b), it says
(21.170) S i m m + [ T i n ] = 0 . (21.170) S i m m + T i n = 0 . {:(21.170)S^(im)_(∣m)+[T^(in)]=0.:}\begin{equation*} S^{i m}{ }_{\mid m}+\left[T^{i n}\right]=0 . \tag{21.170} \end{equation*}(21.170)Simm+[Tin]=0.
[For intuition into this equation of motion, see Exercises 21.25 and 21.26. For applications of the "surface-layer formalism" see exercise 21.27; also Israel (1966), Kuchař (1968), Papapetrou and Hamoui (1968).]
When one turns attention to junction conditions across a null surface Σ Σ Sigma\SigmaΣ, one finds results rather different from those in the spacelike and timelike cases. A "pill-box" integration of the field equations reveals that even in vacuum the extrinsic curvature may be discontinuous. A discontinuity in K i j K i j K_(ij)K_{i j}Kij across a null surface, without any stress-energy to produce it, is the geometric manifestation of a gravitational-wave shock front (analog of a shock-front in hydrodynamics). For quantitative details see, e.g., Pirani (1957), Papapetrou and Treder (1959, 1962), Treder (1962), and especially Choquet-Bruhat (1968b).
That a discontinuity in the curvature tensor can propagate with the speed of light is a reminder that all gravitational effects, like all electromagnetic effects, obey a causal law. The initial-value data on a spacelike initial-value hypersurface uniquely determine the resulting spacetime geometry [see the work of Cartan, Stellmacher, Lichnerowicz, and Bruhat (also under the names Fourès-Bruhat and Choquet-Bruhat) and others cited and summarized in the article of Bruhat (1962)] but determine it in a way consistent with causality. Thus a change in these data throughout a limited region of the initial value 3-geometry makes itself felt on a slightly later hypersurface solely in a region that is also limited, and only a little larger than the original region.
When one turns from classical dynamics to quantum dynamics, one sees new reason to focus attention on a spacelike initial-value hypersurface: the observables at different points on such a hypersurface commute with one another; i.e., are in principle simultaneously observable.
Not every four-dimensional manifold admits a global singularity-free spacelike hypersurface. Those manifolds that do admit such a hypersurface have more to do with physics, it is possible to believe, than those that do not.
Even in a manifold that does admit a spacelike hypersurface, attention has been given sometimes, in the context of classical theory, to initial-value data on a hypersurface that is not spacelike but "characteristic," in the sense that it accomodates null geodesics [see, for example, Sachs (1964) and references cited there]. It is typical in such situations that one can predict the future but not the past, or predict the past but not the future.
Children of light and children of darkness is the vision of physics that emerges from this chapter, as from other branches of physics. The children of light are the differential equations that predict the future from the present. The children of darkness are the factors that fix these initial conditions.

Exercise 21.25. EQUATION OF MOTION FOR A SURFACE LAYER

(a) Let u u u\boldsymbol{u}u be the "mean 4 -velocity" of the matter in a surface layer-so defined that an observer moving with 4 -velocity u u u\boldsymbol{u}u sees zero energy flux. Let σ σ sigma\sigmaσ be the total mass-energy per unit proper surface area, as measured by such a "comoving observer." Show that the surface stress-energy tensor can be expressed in the form
(21.171) S = σ u u + t , where ( t u ) = 0 (21.171) S = σ u u + t ,  where  ( t u ) = 0 {:(21.171)S=sigma u ox u+t","" where "(t*u)=0:}\begin{equation*} \boldsymbol{S}=\sigma \boldsymbol{u} \otimes \boldsymbol{u}+\boldsymbol{t}, \text { where }(\boldsymbol{t} \cdot \boldsymbol{u})=0 \tag{21.171} \end{equation*}(21.171)S=σuu+t, where (tu)=0
and where t t t\boldsymbol{t}t is a symmetric stress tensor.
(b) Show that the component along u u u\boldsymbol{u}u of the equation of motion (21.170) is
(21.172) d σ / d τ = σ u j j + u j t j k k + u j [ T j n ] (21.172) d σ / d τ = σ u j j + u j t j k k + u j T j n {:(21.172)d sigma//d tau=-sigmau^(j)_(∣j)+u_(j)t^(jk)_(∣k)+u_(j)[T^(jn)]:}\begin{equation*} d \sigma / d \tau=-\sigma u^{j}{ }_{\mid j}+u_{j} t^{j k}{ }_{\mid k}+u_{j}\left[T^{j n}\right] \tag{21.172} \end{equation*}(21.172)dσ/dτ=σujj+ujtjkk+uj[Tjn]
where d / d τ = u d / d τ = u d//d tau=ud / d \tau=\boldsymbol{u}d/dτ=u. Give a physical interpretation for each term.
(c) Let a j a j a_(j)a_{j}aj be that part of the 4 -acceleration of the comoving observer which lies in the surface layer Σ Σ Sigma\SigmaΣ. By projecting the equation of motion (21.170) perpendicular to u u u\boldsymbol{u}u, show that
(21.173) σ a j = P j a { t a b b + [ T a n ] } , (21.173) σ a j = P j a t a b b + T a n , {:(21.173)sigmaa_(j)=-P_(ja){t^(ab)_(∣b)+[T^(an)]}",":}\begin{equation*} \sigma a_{j}=-P_{j a}\left\{t^{a b}{ }_{\mid b}+\left[T^{a n}\right]\right\}, \tag{21.173} \end{equation*}(21.173)σaj=Pja{tabb+[Tan]},
where P j a P j a P_(ja)P_{j a}Pja is the projection operator
(21.174) P j a = g j a + u j u a (21.174) P j a = g j a + u j u a {:(21.174)P_(ja)=g_(ja)+u_(j)u_(a):}\begin{equation*} P_{j a}=g_{j a}+u_{j} u_{a} \tag{21.174} \end{equation*}(21.174)Pja=gja+ujua
Give a physical interpretation for each term of equation (21.182).

Exercise 21.26. THIN SHELLS OF DUST

For a thin shell of dust surrounded by vacuum ( [ T j n ] = 0 , t = 0 T j n = 0 , t = 0 [T^(jn)]=0,t=0\left[T^{j n}\right]=0, \boldsymbol{t}=0[Tjn]=0,t=0 ), derive the following equations
(21.175a) d σ / d τ = σ u b b , (21.175b) a + + a = 0 , (21.175c) a + a = ( 4 π σ ) n (21.175~d) γ = 8 π σ ( u u + 1 2 g ) (21.175a) d σ / d τ = σ u b b , (21.175b) a + + a = 0 , (21.175c) a + a = ( 4 π σ ) n (21.175~d) γ = 8 π σ u u + 1 2 g {:[(21.175a)d sigma//d tau=-sigmau^(b)_(∣b)","],[(21.175b)a^(+)+a^(-)=0","],[(21.175c)a^(+)-a^(-)=(4pi sigma)n],[(21.175~d)gamma=8pi sigma(u ox u+(1)/(2)g)]:}\begin{gather*} d \sigma / d \tau=-\sigma u^{b}{ }_{\mid b}, \tag{21.175a}\\ \boldsymbol{a}^{+}+\boldsymbol{a}^{-}=0, \tag{21.175b}\\ \boldsymbol{a}^{+}-\boldsymbol{a}^{-}=(4 \pi \sigma) \boldsymbol{n} \tag{21.175c}\\ \boldsymbol{\gamma}=8 \pi \sigma\left(\boldsymbol{u} \otimes \boldsymbol{u}+\frac{1}{2} \boldsymbol{g}\right) \tag{21.175~d} \end{gather*}(21.175a)dσ/dτ=σubb,(21.175b)a++a=0,(21.175c)a+a=(4πσ)n(21.175~d)γ=8πσ(uu+12g)
Here a + a + a^(+)\boldsymbol{a}^{+}a+and a a a^(-)\boldsymbol{a}^{-}aare the 4-accelerations as measured by accelerometers that are fastened onto the outer and inner sides of the shell, and g g g\boldsymbol{g}g is the 3-metric of the shell. Show that the first of these equations is the law of "conservation of rest mass."

Exercise 21.27. SPHERICAL SHELL OF DUST

Apply the formalism of exercise 21.25 to a collapsing spherical shell of dust [Israel (1967b)]. For the metric inside and outside the shell, take the flat-spacetime and vacuum Schwarzschild expressions (Chapter 23),
(21.176a) d s 2 = d t 2 + d r 2 + r 2 ( d θ 2 + sin 2 θ d ϕ 2 ) inside (21.176b) d s 2 = ( 1 2 M r ) d t 2 + d r 2 1 2 M / r + r 2 ( d θ 2 + sin 2 θ d ϕ 2 ) outside. (21.176a) d s 2 = d t 2 + d r 2 + r 2 d θ 2 + sin 2 θ d ϕ 2  inside  (21.176b) d s 2 = 1 2 M r d t 2 + d r 2 1 2 M / r + r 2 d θ 2 + sin 2 θ d ϕ 2  outside.  {:[(21.176a)ds^(2)=-dt^(2)+dr^(2)+r^(2)(dtheta^(2)+sin^(2)theta dphi^(2))" inside "],[(21.176b)ds^(2)=-(1-(2M)/(r))dt^(2)+(dr^(2))/(1-2M//r)+r^(2)(dtheta^(2)+sin^(2)theta dphi^(2))" outside. "]:}\begin{gather*} d s^{2}=-d t^{2}+d r^{2}+r^{2}\left(d \theta^{2}+\sin ^{2} \theta d \phi^{2}\right) \text { inside } \tag{21.176a}\\ d s^{2}=-\left(1-\frac{2 M}{r}\right) d t^{2}+\frac{d r^{2}}{1-2 M / r}+r^{2}\left(d \theta^{2}+\sin ^{2} \theta d \phi^{2}\right) \text { outside. } \tag{21.176b} \end{gather*}(21.176a)ds2=dt2+dr2+r2(dθ2+sin2θdϕ2) inside (21.176b)ds2=(12Mr)dt2+dr212M/r+r2(dθ2+sin2θdϕ2) outside. 
Let the "radius" of the shell, as a function of proper time measured on the shell, be
(21.176c) R 1 2 π × ( proper circumference of shell ) = R ( τ ) (21.176c) R 1 2 π × (  proper circumference of shell  ) = R ( τ ) {:(21.176c)R-=(1)/(2pi)xx(" proper circumference of shell ")=R(tau):}\begin{equation*} R \equiv \frac{1}{2 \pi} \times(\text { proper circumference of shell })=R(\tau) \tag{21.176c} \end{equation*}(21.176c)R12π×( proper circumference of shell )=R(τ)
Show that the shell's mass density varies with time as
(21.176d) σ ( τ ) = μ / 4 π R 2 ( τ ) , μ = constant = "total rest mass"; (21.176d) σ ( τ ) = μ / 4 π R 2 ( τ ) , μ =  constant  =  "total rest mass";  {:(21.176d)sigma(tau)=mu//4piR^(2)(tau)","quad mu=" constant "=" "total rest mass"; ":}\begin{equation*} \sigma(\tau)=\mu / 4 \pi R^{2}(\tau), \quad \mu=\text { constant }=\text { "total rest mass"; } \tag{21.176d} \end{equation*}(21.176d)σ(τ)=μ/4πR2(τ),μ= constant = "total rest mass"; 
and derive and solve the equation of motion
(21.176e) M = μ { 1 + ( d R d τ ) 2 } 1 / 2 μ 2 R (21.176e) M = μ 1 + d R d τ 2 1 / 2 μ 2 R {:(21.176e)M=mu{1+((dR)/(d tau))^(2)}^(1//2)-(mu)/(2R):}\begin{equation*} M=\mu\left\{1+\left(\frac{d R}{d \tau}\right)^{2}\right\}^{1 / 2}-\frac{\mu}{2 R} \tag{21.176e} \end{equation*}(21.176e)M=μ{1+(dRdτ)2}1/2μ2R

  1. *This thought experiment was devised by Bondi [1957, 1965; Bondi and McCrea (1960)] as a means for convincing skeptics of the reality of gravitational waves.
  2. *Stokes (1887) and other standard references deny this legend. In part I of Stokes the basic manuscript references are listed, including especially codex manuscript Rawlinson B. 512 in 154 folios, in double columns, written by various hands in the fourteenth and fifteenth centuries (cf. Catalogi codicum manuscriptorum Bibliothecae Bodleianae Partis Quintae Fasciculus Primus, Oxford, 1862, col. 728-732). In this manuscript, folio 97b.1, line 14, reads in the translation of Stokes, Part I, p. xxx: "as Paradise is without beasts, without a snake, without a lion, without a dragon, without a scorpion, without a mouse, without a frog, so is Ireland in the same manner without any harmful animal, save only the wolf. . ."
  3. To Karel Kuchař, Claudio Teitelboim, and James York go warm thanks for their collaboration in the preparation of this chapter, and for permission to draw on the lecture notes of K. K. and to quote results of K. K. [especially exercise 21.10] and of J. Y. [especially equations (21.87), (21.88), and (21.152)] prior to publication elsewhere.
    • Historical remark. No one knew until recently what coordinate-free geometric-physical quantity really is fixed at limits in the Hilbert-Palatini variational principle. In his pioneering work on the Hamiltonian formulation of general relativity, Dirac paid no particular attention to any variational principle. He had to generalize the Hamiltonian formalism to accommodate it to general relativity, introducing "first- and second-class constraints" and generalizations of the Poisson brackets of classical mechanics. The work of Arnowitt, Deser, and Misner, by contrast, took the variational principle as the foundation for the whole treatment, even though they too did not ask what it is that is fixed at limits in the sense of
  4. ^(**){ }^{*} Here Sachs' equation (10) is generalized to the case where the unit normal n n n\boldsymbol{n}n is not necessarily timelike. Sachs used n = / t n = / t n=del//del t\boldsymbol{n}=\partial / \partial tn=/t.
    • The original formulation of gravitational junction conditions stemmed from Lanczos ( 1922 , 1924 ) ( 1922 , 1924 ) (1922,1924)(1922,1924)(1922,1924). The formulation given here, in terms of intrinsic and extrinsic curvature, was developed by Darmois (1927), Misner and Sharp (1964), and Israel (1966). For further references to the extensive literature, see Israel.